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(57) Abstract 



Cancer associated antigens have been identified by autologous antibody screening of libraries of nucleic acids expressed in renal cancer 
cells using antisera from cancer patients. The invention relates to nucleic acids and encoded polypeptides which are cancer associated antigens 
expressed in patients afflicted with renal cancer. The invention provides, inter alia, isolated nucleic acid molecules, expression vectors 
containing those molecules and host cells transfected with those molecules. The invention also provides isolated proteins and peptides 
antibodies to those proteins and peptides and cytotoxic T lymphocytes which recognize the proteins and peptides. Fragments of the 
foregoing including functional fragments and variants also are provided. Kits containing the foregoing molecules additionally are provided. 
The molecules provided by the invention can be used in the diagnosis, monitoring, research, or treatment of conditions characterized by the 
expression of one or more cancer associated antigens. 
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Summary of the In vent inn 

Autologous antibody screening has now been applied to renal cancer using antisera 
from cancer patients. Numerous cancer associated antigens have been identified. The 
invention provides, inter alia, isolated nucleic acid molecules, expression vectors containing 
those molecules and host cells transacted with those molecules. The invention also provides 
isolated proteins and peptides, antibodies to those proteins and peptides and CTLs which 
recognize the proteins and peptides. Fragments including functional fragments and variants of 
the foregoing also are provided. Kits containing the foregoing molecules additionally are 
provided. The foregoing can be used in the diagnosis, monitoring, research, or treatment of 
conditions characterized by the expression of one or more cancer associated antigens. 

Prior to the present invention, only a handful of cancer associated genes had been 
identified in the past 20 years. The invention involves the surprising discovery of several 
genes, some previously known and some previously unknown, which are expressed in 
individuals who have cancer. These individuals all have serum antibodies against the proteins 
(or fragments thereof) encoded by these genes. Thus, abnormally expressed genes are 
recognized by the host's immune system and therefore can form a basis for diagnosis, 
monitoring and therapy. 

The invention involves the use of a single material, a plurality of different materials 
and even large panels and combinations of materials. For example, a single gene, a single 
protein encoded by a gene, a single functional fragment thereof, a single antibody thereto, etc. 
can be used in methods and products of the invention. Likewise, pairs, groups and even 
panels of these materials and optionally other cancer associated antigen genes and/or gene 
products can be used for diagnosis, monitoring and therapy. The pairs, groups or panels can 
involve 2, 3, 4, 5 or more genes, gene products, fragments thereof or agents that recognize 
such materials. A plurality of such materials are not only useful in monitoring, typing, 
characterizing and diagnosing cells abnormally expressing such genes, but a plurality of such 
materials can be used therapeutically. An example of the use of a plurality of such materials 
for the prevention, delay of onset, amelioration, etc. of cancer cells, which express or will 
express such genes prophylactically or acutely . Any and all combinations of the genes, gene 
products, and materials which recognize the genes and gene products can be tested and 
identified for use according to the invention. It would be far too lengthy to recite all such 
combinations; those skilled in the art, particularly in view of the teaching contained herein, 
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CANCER ASSOCIATED ANTIGENS AND USES THEREFOR 

Field of the Invention 

The invention relates to nucleic acids and encoded polypeptides which are cancer 
associated antigens expressed in patients afflicted with a variety of cancers. The invention 
also relates to agents which bind the nucleic acids or polypeptides. The nucleic acid 
molecules, polypeptides coded for by such molecules and peptides derived therefrom, as well 
as related antibodies and cytolytic T lymphocytes, are useful, inter alia, in diagnostic and 
therapeutic contexts. 

Background of the Invention 

The mechanism by which T cells recognize foreign materials has been implicated in 
cancer. A number of cytolytic T lymphocyte (CTL) clones directed against autologous 
melanoma antigens, testicular antigens, and melanocyte differentiation antigens have been 
described. In many instances, the antigens recognized by these clones have been 
characterized. 

The use of autologous CTLs for identifying tumor antigens requires that the target 
cells which express the antigens can be cultured in vitro and that stable lines of autologous 
CTL clones which recognize the antigen-expressing cells can be isolated and propagated. 
While this approach has worked well for melanoma antigens, other tumor types, such as 
epithelial cancers including breast and colon cancer, have proved refractory to the approach. 

More recently another approach to the problem has been described by Sahin et al. 
(Proc. Natl. Acad. Sci. USA 92:11810-11813, 1995). According to this approach, autologous 
antisera are used to identify immunogenic protein antigens expressed in cancer cells by 
screening expression libraries constructed from tumor cell cDNA. Antigen-encoding clones 
so identified have been found to have elicited an high-titer humoral immune response in the 
patients from which the antisera were obtained. Such a high-titer IgG response implies helper 
T cell recognition of the detected antigen. These tumor antigens can then be screened for the 
presence of MHC/HLA class I and class II motifs and reactivity with CTLs 

Presently there is a need for additional cancer antigens for development of therapeutics 
and diagnosis applicable to a greater number of cancer patients having various cancers. 
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will readily be able to determine which combinations are most appropriate for which 
circumstances. 

As will be clear from the following discussion, the invention has in vivo and in vitro 
uses, including for therapeutic, diagnostic, monitoring and research purposes. One aspect of 
the invention is the ability to fingerprint a cell expressing a number of the genes identified 
according to the invention by, for example, quantifying the expression of such gene products. 
Such fingerprints will be characteristic, for example, of the stage of the cancer, the type of the 
cancer, or even the effect in animal models of a therapy on a cancer. Cells also can be 
screened to determine whether such cells abnormally express the genes identified according to 
the invention. 

The invention, in one aspect, is a method of diagnosing a disorder characterized by 
expression of a cancer associated antigen precursor coded for by a nucleic acid molecule. The 
method involves the steps of contacting a biological sample isolated from a subject with an 
agent that specifically binds to the nucleic acid molecule, an expression product thereof, or a 
fragment of an expression product thereof complexed with an MHC, preferably an HLA, 
molecule, wherein the nucleic acid molecule is aNA Group 1 nucleic acid molecule, and 
determining the interaction between the agent and the nucleic acid molecule, the expression 
product or fragment of the expression product as a determination of the disorder. 

In one embodiment the agent is selected from the group consisting of (a) a nucleic 
acid molecule comprising NA Group 1 nucleic acid molecules or a fragment thereof, (b) a 
nucleic acid molecule comprising NA Group 3 nucleic acid molecules or a fragment thereof, 
(c) a nucleic acid molecule comprising NA Group 5 nucleic acid molecules or a fragment 
thereof, (d) an antibody that binds to an expression product, or a fragment thereof, of NA 
group 1 nucleic acids, (e) an antibody that binds to an expression product, or a fragment 
thereof, of NA group 3 nucleic acids, (f) an antibody that binds to an expression product, or a 
fragment thereof, of NA group 5 nucleic acids, (g) and agent that binds to a complex of an 
MHC, preferably HLA, molecule and a fragment of an expression product of a NA Group 1 
nucleic acid, (h) an agent that binds to a complex of an MHC, preferably HLA, molecule and 
a fragment of an expression product of a NA group 3 nucleic acid, and (i) an agent that binds 
to a complex of an MHC, preferably HLA, molecule and a fragment of an expression product 
of aNA Group 5 nucleic acid. 

The disorder may be characterized by expression of a plurality of cancer associated 
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antigen precursors. Thus the methods of diagnosis may include use of a plurality of agents, 
each of which is specific for a different human cancer associated antigen precursor (including 
at least one of the cancer associated antigen precursors disclosed herein), and wherein said 
plurality of agents is at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at 
least 9 or at least 10 such agents. 

In each of the above embodiments the agent may be specific for a human cancer 
associated antigen precursor that is selected from the group consisting of breast, gastric, lung, 
prostate, renal, colon, thyroid, Hodgkin's disease, and hepatocarcinoma cancer associated 
antigen precursors. 

In another aspect the invention is a method for determining regression, progression or 
onset of a condition characterized by expression of abnormal levels of a protein encoded by a 
nucleic acid molecule that is a NA Group 1 molecule. The method involves the steps of 
monitoring a sample, from a subject who has or is suspected of having the condition, for a 
parameter selected from the group consisting of (i) the protein, (ii) a peptide derived from the 
protein, (iii) an antibody which selectively binds the protein or peptide, and (iv) cytolytic T 
cells specific for a complex of the peptide derived from the protein and an MHC molecule, as 
a determination of regression, progression or onset of said condition. In one embodiment the 
sample is a body fluid, a body effusion or a tissue. 

In another embodiment the step of monitoring comprises contacting the sample with a 
detectable agent selected from the group consisting of (a) an antibody which selectively binds 
the protein of (i), or the peptide of (ii), (b) a protein or peptide which binds the antibody of 
(iii), and (c) a cell which presents the complex of the peptide and MHC molecule of (iv). In a 
preferred embodiment the antibody, the protein, the peptide or the cell is labeled with a 
radioactive label or an enzyme. The sample in a preferred embodiment is assayed for the 
peptide. 

According to another embodiment the nucleic acid molecule is one of the following: a 
NA Group 3 molecule or a NA Group 5 molecule. In yet another embodiment the protein is a 
plurality of proteins, the parameter is a plurality of parameters, each of the plurality of 
parameters being specific for a different of the plurality of proteins. In certain embodiments 
the protein is a plurality of proteins, at least one of which is kinectin, the remainder of which 
are non-kinectin cancer associated proteins, and wherein the parameter is a plurality of 
parameters, each of the plurality of parameters being specific for a different of the plurality of 
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proteins. 

The invention in another aspect is a pharmaceutical preparation for a human subject. 
The pharmaceutical preparation includes an agent which when administered to the subject 
enriches selectively the presence of complexes of an HLA molecule and a human cancer 
associated antigen, and a pharmaceutical^ acceptable carrier, wherein the human cancer 
associated antigen is a fragment of a human cancer associated antigen precursor encoded by a 
nucleic acid molecule which comprises a NA Group 1 molecule. In one embodiment the 
nucleic acid molecule is a NA Group 3 nucleic acid molecule. 

The agent in one embodiment comprises a plurality of agents, each of which enriches 
selectively in the subject complexes of an HLA molecule and a different human cancer 
associated antigen. Preferably the plurality is at least two, at least three, at least four or at 
least 5 different such agents. 

In certain embodiments, the agent comprises a plurality of agents, at least one of 
which is kinectin, the remainder of which are non-kinectin cancer associated proteins, and 
each of which enriches selectively in the subject complexes of an HLA molecule and a 
different human cancer associated antigen. 

In another embodiment the agent is selected from the group consisting of (1) an 
isolated polypeptide comprising the human cancer associated antigen, or a functional variant 
thereof, (2) an isolated nucleic acid operably linked to a promoter for expressing the isolated 
polypeptide, or functional variant thereof, (3) a host cell expressing the isolated polypeptide, 
or functional variant thereof, and (4) isolated complexes of the polypeptide, or functional 
variants thereof, and an HLA molecule. 

The agent may be a cell expressing an isolated polypeptide. In one embodiment the 
agent is a cell expressing an isolated polypeptide comprising the human cancer associated 
antigen or a functional variant thereof. In another embodiment the agent is a cell expressing 
an isolated polypeptide comprising the human cancer associated antigen or a functional 
variant thereof, and wherein the cell expresses an HLA molecule that binds the polypeptide. 
The cell can express one or both of the polypeptide and HLA molecule recombinantly. In 
preferred embodiments the cell is nonproliferative. In other preferred embodiments, the 
isolated polypeptide is or includes a kinectin polypeptide. In yet another embodiment the 
agent is at least two, at least three, at least four or at least five different polypeptides, each 
representing a different human cancer associated antigen or functional variant thereof. 
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The agent in one embodiment is a PP Group 2 polypeptide. In other embodiments the 
agent is a PP Group 3 polypeptide or a PP Group 4 polypeptide. 

In an embodiment each of the pharmaceutical preparations described herein also 
includes an adjuvant. 

According to another aspect the invention, a composition is provided which includes 
an isolated agent that binds selectively a PP Group 1 polypeptide. In separate embodiments 
the agent binds selectively to a polypeptide selected from the following: a PP Group 2 
polypeptide, a PP Group 3 polypeptide, a PP Group 4 polypeptide, and a PP Group 5 
polypeptide. In other embodiments, the agent is a plurality of different agents that bind 
selectively at least two, at least three, at least four, or at least five different such polypeptides. 
In each of the above described embodiments the agent may be an antibody. In a preferred 
embodiment, at least one of polypeptides is kinectin or a fragment thereof. 

In another aspect the invention is a composition of matter composed of a conjugate of 
the agent of the above-described compositions of the invention and a therapeutic or diagnostic 
agent. Preferably the conjugate is of the agent and a therapeutic or diagnostic that is an 
antineoplastic. 

The invention in another aspect is a pharmaceutical composition which includes an 
isolated nucleic acid molecule selected from the group consisting of: (1) NA Group 1 
molecules, and (2) NA Group 2 molecules, and a pharmaceutical^ acceptable carrier. In one 
embodiment the isolated nucleic acid molecule comprises a NA Group 3 or NA Group 4 
molecule. In another embodiment the isolated nucleic acid molecule comprises at least two 
isolated nucleic acid molecules coding for two different polypeptides, each polypeptide 
comprising a different cancer associated antigen. In preferred embodiments, at least one of 
the polypeptides is a kinectin polypeptide. 

Preferably the pharmaceutical composition also includes an expression vector with a 
promoter operably linked to the isolated nucleic acid molecule. In another embodiment the 
pharmaceutical composition also includes a host cell recombinantly expressing the isolated 
nucleic acid molecule. 

According to another aspect of the invention a pharmaceutical composition is 
provided. The pharmaceutical composition includes an isolated polypeptide comprising a PP 
Group 1 or a PP Group 2 polypeptide, and a pharmaceutical ly acceptable carrier. In one 
embodiment the isolated polypeptide comprises a PP Group 3 or a PP Group 4 polypeptide. 
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In another embodiment the isolated polypeptide comprises at least two different 
polypeptides, each comprising a different cancer associated antigen at least one of which is 
encoded by a NA group 1 molecule as disclosed herein. In separate embodiments the isolated 
polypeptides are selected from the following: PP Group 3 polypeptides or HLA binding 
fragments thereof and PP Group 5 polypeptides or HLA binding fragments thereof. 

In an embodiment each of the pharmaceutical compositions described herein also 
includes an adjuvant. 

Another aspect the invention is an isolated nucleic acid molecule comprising a NA 
Group 3 molecule. Another aspect the invention is an isolated nucleic acid molecule 
comprising a NA Group 4 molecule. 

The invention in another aspect is an isolated nucleic acid molecule selected from the 
group consisting of (a) a fragment of a nucleic acid selected from the group of nucleic acid 
molecules consisting of SEQ ID Nos numbered below and comprising all nucleic acid 
sequences among SEQ ID NOs 1-1 1 and 22-46, of sufficient length to represent a sequence 
unique within the human genome, and identifying a nucleic acid encoding a human cancer 
associated antigen precursor, (b) complements of (a), provided that the fragment includes a 
sequence of contiguous nucleotides which is not identical to any sequence selected from the 
sequence group consisting of (1) sequences having the GenBank accession numbers of Table 
1, (2) complements of (1), and (3) fragments of (1) and (2). 

In one embodiment the sequence of contiguous nucleotides is selected from the group 
consisting of: (1) at least two contiguous nucleotides nonidentical to the sequences in Table 1, 
(2) at least three contiguous nucleotides nonidentical to the sequences in Table 1, (3) at least 
four contiguous nucleotides nonidentical to the sequences in Table 1, (4) at least five 
contiguous nucleotides nonidentical to the sequences in Table 1 , (5) at least six contiguous 
nucleotides nonidentical to the sequences in Table 1, or (6) at least seven contiguous 
nucleotides nonidentical to the sequences in Table 1 . 

In another embodiment the fragment has a size selected from the group consisting of at 
least: 8 nucleotides, 1 0 nucleotides, 12 nucleotides, 14 nucleotides, 1 6 nucleotides, 18 
nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 nucleotides, 28 nucleotides, 30 
nucleotides, 50 nucleotides, 75 nucleotides, 1 00 nucleotides, 200 nucleotides, 1000 
nucleotides and every integer length therebetween. 

In yet another embodiment the molecule encodes a polypeptide which, or a fragment 
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of which, binds a human HLA receptor or a human antibody. 

Another aspect of the invention is an expression vector comprising an isolated nucleic 
acid molecule of the invention described above operably linked to a promoter. 

According to one aspect the invention is an expression vector comprising a nucleic 
acid operably linked to a promoter, wherein the nucleic acid is a NA Group 1 or Group 2 
molecule. In another aspect the invention is an expression vector comprising a NA Group 1 
or Group 2 molecule and a nucleic acid encoding an MHC, preferably HLA, molecule. 

In yet another aspect the invention is a host cell transformed or transfected with an 
expression vector of the invention described above. 

In another aspect the invention is a host cell transformed or transfected with an 
expression vector comprising an isolated nucleic acid molecule of the invention described 
above operably linked to a promoter, or an expression vector comprising a nucleic acid 
operably linked to a promoter, wherein the nucleic acid is a NA Group 1 or 2 molecule and 
further comprising a nucleic acid encoding HLA. 

According to another aspect of the invention an isolated polypeptide encoded by the isolated 
nucleic acid molecules the invention, described above, is provided. These include PP Group 
1-5 polypeptides. The invention also includes a fragment of the polypeptide which is 
immunogenic. In one embodiment the fragment, or a portion of the fragment, binds HLA or a 
human antibody. In still another aspect the invention provides as isolated polypeptide 
comprising a fragment of a kinectin polypeptide which is immunogenic. 

The invention includes in another aspect an isolated fragment of a human cancer 
associated antigen precursor which, or portion of which, binds HLA or a human antibody, 
wherein the precursor is encoded by a nucleic acid molecule that is a NA Group 1 molecule. 
In one embodiment the fragment is part of a complex with HLA. In another embodiment the 
fragment is between 8 and 12 amino acids in length. In another embodiment the invention 
includes an isolated polypeptide comprising a fragment of the polypeptide of sufficient length 
to represent a sequence unique within the human genome and identifying a polypeptide that is 
a human cancer associated antigen precursor. 

According to another aspect of the invention a kit for detecting the presence of the 
expression of a cancer associated antigen precursor is provided. The kit includes a pair of 
isolated nucleic acid molecules each of which consists essentially of a molecule selected from 
the group consisting of (a) a 12-32 nucleotide contiguous segment of the nucleotide sequence 
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of any of the NA Group 1 molecules and (b) complements of ("a"), wherein the contiguous 
segments are nonoverlapping. In one embodiment the pair of isolated nucleic acid molecules 
is constructed and arranged to selectively amplify an isolated nucleic acid molecule that is a 
NA Group 3 molecule. Preferably, the pair amplifies a human NA Group 3 molecule. 

According to another aspect of the invention a method for treating a subject with a 
disorder characterized by expression of a human cancer associated antigen precursor is 
provided. The method includes the step of administering to the subject an amount of an agent, 
which enriches selectively in the subject the presence of complexes of an HLA molecule and a 
human cancer associated antigen, effective to ameliorate the disorder, wherein the human 
cancer associated antigen is a fragment of a human cancer associated antigen precursor 
encoded by a nucleic acid molecule selected from the group consisting of (a) a nucleic acid 
molecule comprising NA group 1 nucleic acid molecules, (b) a nucleic acid molecule 
comprising NA group 3 nucleic acid molecules, (c) a nucleic acid molecule comprising NA 
group 5 nucleic acid molecules. 

In one embodiment the disorder is characterized by expression of a plurality of human 
cancer associated antigen precursors and wherein the agent is a plurality of agents, each of 
which enriches selectively in the subject the presence of complexes of an HLA molecule and a 
different human cancer associated antigen. Preferably the plurality is at least 2, at least 3, at 
least 4, or at least 5 such agents. In a preferred embodiment, at least one of the human cancer 
associated antigens is kinectin or a fragment thereof. 

In another embodiment the agent is an isolated polypeptide selected from the group 
consisting of PP Group 1, PP Group 2, PP Group 3, PP Group 4, and PP group 5 polypeptides. 

In yet another embodiment the disorder is cancer. 

According to another aspect the invention is a method for treating a subject having a 
condition characterized by expression of a cancer associated antigen precursor in cells of the 
subject. The method includes the steps of (i) removing an immunoreactive cell containing 
sample from the subject, (ii) contacting the immunoreactive cell containing sample to the host 
cell under conditions favoring production of cytolytic T cells against a human cancer 
associated antigen which is a fragment of the precursor, (iii) introducing the cytolytic T cells 
to the subject in an amount effective to lyse cells which express the human cancer associated 
antigen, wherein the host cell is transformed or transfected with an expression vector 
comprising an isolated nucleic acid molecule operably linked to a promoter, the isolated 
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nucleic acid molecule being selected from the group of nucleic acid molecules consisting of 
NA Group 1, NA Group 2, NA Group 3, NA Group 4, NA Group 5. 

In one embodiment the host cell recombinantly expresses an HLA molecule which 
binds the human cancer associated antigen. In another embodiment the host cell 
endogenously expresses an HLA molecule which binds the human cancer associated antigen. 

The invention includes in another aspect a method for treating a subject having a 
condition characterized by expression of a cancer associated antigen precursor in cells of the 
subject. The method includes the steps of (i) identifying a nucleic acid molecule expressed by 
the cells associated with said condition, wherein said nucleic acid molecule is a NA Group 1 
molecule (ii) transfecting a host cell with a nucleic acid selected from the group consisting of 
(a) the nucleic acid molecule identified, (b) a fragment of the nucleic acid identified which 
includes a segment coding for a cancer associated antigen, (c) deletions, substitutions or 
additions to (a) or (b), and (d) degenerates of (a), (b), or (c); (iii) culturing said transfected 
host cells to express the transfected nucleic acid molecule, and; (iv) introducing an amount of 
said host cells or an extract thereof to the subject effective to increase an immune response 
against the cells of the subject associated with the condition. Preferably, the antigen is a 
human antigen and the subject is a human. In certain preferred embodiments the nucleic acid 
molecule is a kinectin nucleic acid molecule. 

In one embodiment the method also includes the step of (a) identifying an MHC 
molecule which presents a portion of an expression product of the nucleic acid molecule, 
wherein the host cell expresses the same MHC molecule as identified in (a) and wherein the 
host cell presents an MHC binding portion of the expression product of the nucleic acid 
molecule. 

In another embodiment the method also includes the step of treating the host cells to 
render them non-proliferative. 

In yet another embodiment the immune response comprises a B-cell response or a T 
cell response. Preferably the response is a T-cell response which comprises generation of 
cytolytic T-cells specific for the host cells presenting the portion of the expression product of 
the nucleic acid molecule or cells of the subject expressing the human cancer associated 
antigen. 

In another embodiment the nucleic acid molecule is a NA Group 3 molecule. 
Another aspect of the invention is a method for treating or diagnosing or monitoring a 
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subject having a condition characterized by expression of an abnormal amount of a protein 
encoded by a nucleic acid molecule that is aNA Group 1 molecule. The method includes the 
step of administering to the subject an antibody which specifically binds to the protein or a 
peptide derived therefrom, the antibody being coupled to a therapeutically useful agent, in an 
amount effective to treat the condition. 

In one embodiment the antibody is a monoclonal antibody. Preferably the monoclonal 
antibody is a chimeric antibody or a humanized antibody. 

In another aspect the invention is a method for treating a condition characterized by 
expression in a subject of abnormal amounts of a protein encoded by a nucleic acid molecule 
that is a NA Group 1 nucleic acid molecule. The method involves the step of administering to 
a subject at least one of the pharmaceutical compositions of the invention described above in 
an amount effective to prevent, delay the onset of, or inhibit the condition in the subject. In 
one embodiment the condition is cancer. In another embodiment the method includes the step 
of first identifying that the subject expresses in a tissue abnormal amounts of the protein. 

The invention in another aspect is a method for treating a subject having a condition 
characterized by expression of abnormal amounts of a protein encoded by a nucleic acid 
molecule that is a NA Group 1 nucleic acid molecule. The method includes the steps of (i) 
identifying cells from the subject which express abnormal amounts of the protein; (ii) 
isolating a sample of the cells; (iii) cultivating the cells, and (iv) introducing the cells to the 
subject in an amount effective to provoke an immune response against the cells. 

In one embodiment the method includes the step of rendering the cells non- 
proliferative, prior to introducing them to the subject. 

In another aspect the invention is a method for treating a pathological cell condition 
characterized by abnormal expression of a protein encoded by a nucleic acid molecule that is a 
NA Group 1 nucleic acid molecule. The method includes the step of administering to a 
subject in need thereof an effective amount of an agent which inhibits the expression or 
activity of the protein. 

In one embodiment the agent is an inhibiting antibody which selectively binds to the 
protein and wherein the antibody is a monoclonal antibody, a chimeric antibody, a humanized 
antibody or a fragment thereof. In another embodiment the agent is an antisense nucleic acid 
molecule which selectively binds to the nucleic acid molecule which encodes the protein. In 
yet another important embodiment the nucleic acid molecule is aNA Group 3 nucleic acid 
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molecule. In other preferred embodiments, the nucleic acid molecule is a kinectin nucleic 
acid molecule. 

The invention includes in another aspect a composition of matter useful in stimulating 
an immune response to a plurality of proteins encoded by nucleic acid molecules that are NA 
Group 1 molecules. The composition is a plurality of peptides derived from the amino acid 
sequences of the proteins, wherein the peptides bind to one or more MHC molecules 
presented on the surface of the cells which express an abnormal amount of the protein. In 
preferred embodiments, at least one of the proteins is kinectin. 

In one embodiment at least a portion of the plurality of peptides bind to MHC 
molecules and elicit a cytolytic response thereto. In another embodiment the composition of 
matter includes an adjuvant. In another embodiment the adjuvant is a saponin, GM-CSF, or 
an interleukin. In still another embodiment, the compositions also includes at least one 
peptide useful in stimulating an immune response to at least one protein which is not encoded 
by nucleic acid molecules that are NA Group 1 molecules, wherein the at least one peptide 
binds to one or more MHC molecules. 

According to another aspect the invention is an isolated antibody which selectively 
binds to a complex of: (i) a peptide derived from a protein encoded by a nucleic acid molecule 
that is a NA Group 1 molecule and (ii) and an MHC molecule to which binds the peptide to 
form the complex, wherein the isolated antibody does not bind to (i) or (ii) alone. 

In one embodiment the antibody is a monoclonal antibody, a chimeric antibody, a 
humanized antibody or a fragment thereof. 

The invention also involves the use of the genes, gene products, fragments thereof, 
agents which bind thereto, and so on in the preparation of medicaments. A particular 
medicament is for treating cancer and a more particular medicament is for treating breast 
cancer, lung cancer, renal cancer, colon cancer, prostate cancer or gastric cancer. 

Detailed Description of the Invention 

In the above summary and in the ensuing description, lists of sequences are provided. 
The lists are meant to embrace each single sequence separately, two or more sequences 
together where they form a part of the same gene, any combination of two or more sequences 
which relate to different genes, including and up to the total number on the list, as if each and 
every combination were separately and specifically enumerated. Likewise, when mentioning 
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fragment size, it is intended that a range embrace the smallest fragment mentioned to the full- 
length of the sequence (less one nucleotide or amino acid so that it is a fragment), each and 
every fragment length intended as if specifically enumerated. Thus, if a fragment could be 
between 10 and 15 in length, it is explicitly meant to mean 10, 1 1, 12, 13, 14, or 15 in length. 

The summary and the claims mention antigen precursors and antigens. As used in the 
summary and in the claims, a precursor is substantially the full-length protein encoded by the 
coding region of the isolated DNA and the antigen is a peptide which complexes with MHC, 
preferably HLA, and which participates in the immune response as part of that complex. Such 
antigens are typically 9 amino acids long, although this may vary slightly. 

As used herein, a subject is a human, non-human primate, cow, horse, pig, sheep, goat, 
dog, cat or rodent. In all embodiments human cancer antigens and human subjects are 
preferred. 

The present invention in one aspect involves the cloning of cDNAs encoding human 
cancer associated antigen precursors using autologous antisera of subjects having renal cancer. 
The sequences of the clones representing genes identified according to the methods described 
herein are presented in the attached Sequence Listing. Of the foregoing, it can be seen that 
some of the clones are considered completely novel as no nucleotide or amino acid 
homologies to coding regions were found in the databases searched. Other clones are novel 
but have some homology to sequences deposited in databases (mainly EST sequences). 
Nevertheless, the entire gene sequence was not previously known. In some cases no function 
was suspected and in other cases, even if a function was suspected, it was not know that the 
gene was associated with cancer. In all cases, it was not known or suspected that the gene 
encoded a cancer antigen which reacted with antibody from autologous sera. Analysis of the 
clone sequences by comparison to nucleic acid and protein databases determined that still 
other of the clones surprisingly are closely related to other previously-cloned genes. The 
sequences of these related genes is also presented in the Sequence Listing. The nature of the 
foregoing genes as encoding antigens recognized by the immune systems of cancer patients is, 
of course, unexpected. 

The invention thus involves in one aspect cancer associated antigen polypeptides, 
genes encoding those polypeptides, functional modifications and variants of the foregoing, 
useful fragments of the foregoing, as well as diagnostics and therapeutics relating thereto. 

Homologs and alleles of the cancer associated antigen nucleic acids of the invention 
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can be identified by conventional techniques. Thus, an aspect of the invention is those nucleic 
acid sequences which code for cancer associated antigen precursors. Because this application 
contains so many sequences, the following chart is provided to identify the various groups of 
sequences discussed in the claims and in the summary: 



to a 



or 



Nucleic Acid Sequences 

NA Group 1 . (a) nucleic acid molecules which hybridize under stringent conditions 
molecule consisting of a nucleic acid sequence selected from the group consisting of nucleic 
acid sequences among SEQ ID NOs: 1-1 1 and 22-46 and which code for a cancer associated 
antigen precursor, 

(b) deletions, additions and substitutions which code for a respective cancer 
associated antigen precursor, 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) 
(b) in codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

NA Group 2. Fragments of NA Group 1, which codes for a polypeptide which, or a portion 
of which, binds an MHC molecule to form a complex recognized by a an autologous antibody 
or lymphocyte. 

NA Group 3. The subset of NA Group 1 where the nucleotide sequence is selected from the 
group consisting of: 

(a) previously unknown human nucleic acids coding for a human cancer 
associated antigen precursor (i.e. nucleic acid sequences among SEQ ID NOs: 1-11), 

(b) deletions, additions and substitutions which code for a respective human 
cancer associated antigen precursor, 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) 
(b) in codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 



or 
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NA Group 4. Fragments of NA Group 3, which code for a polypeptide which, or a portion of 
which, binds to an MHC molecule to form a complex recognized by an autologous antibody 
or lymphocyte. 

NA Group 5. A subset of NA Group 1, comprising human cancer associated antigens that 
react with allogeneic cancer antisera. 

Polypeptide Sequences 
PP Group 1 . Polypeptides encoded by NA Group 1 . 
PP Group 2. Polypeptides encoded by NA Group 2 
PP Group 3 . Polypeptides encoded by NA Group 3 . 
PP Group 4. Polypeptides encoded by NA Group 4. 
PP Group 5 . Polypeptides encoded by NA Group 5. 

The term "stringent conditions" as used herein refers to parameters with which the art 
is familiar. Nucleic acid hybridization parameters may be found in references which compile 
such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., 
Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 
or Current Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, 
Inc., New York. More specifically, stringent conditions, as used herein, refers, for example, 
to hybridization at 65°C in hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl 
pyrrolidone, 0.02% Bovine Serum Albumin, 2.5mM NaH 2 P0 4 (pH7), 0.5% SDS, 2mM 
EDTA). SSC is 0.15M sodium chloride/0. 15M sodium citrate, pH7; SDS is sodium dodecyl 
sulphate; and EDTA is ethylenediaminetetracetic acid. After hybridization, the membrane 
upon which the DNA is transferred is washed, for example, in 2 x SSC at room temperature 
and then at 0.1 - 0.5 x SSC/0.1 x SDS at temperatures up to 68°C. 

There are other conditions, reagents, and so forth which can be used, which result in a 
similar degree of stringency. The skilled artisan will be familiar with such conditions, and 
thus they are not given here. It will be understood, however, that the skilled artisan will be 
able to manipulate the conditions in a manner to permit the clear identification of homologs 
and alleles of cancer associated antigen nucleic acids of the invention (e.g., by using lower 
stringency conditions). The skilled artisan also is familiar with the methodology for screening 
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cells and libraries for expression of such molecules which then are routinely isolated, followed 
by isolation of the pertinent nucleic acid molecule and sequencing. 

In general homologs and alleles typically will share at least 75% nucleotide identity 
and/or at least 90% amino acid identity to the sequences of cancer associated antigen nucleic 
acid and polypeptides, respectively, in some instances will share at least 90% nucleotide 
identity and/or at least 95% amino acid identity and in still other instances will share at least 
95% nucleotide identity and/or at least 99% amino acid identity. The homology can be 
calculated using various, publicly available software tools developed by NCBI (Bethesda, 
Maryland) that can be obtained through the internet (ftp:/ncbi.nlm.nih.gov/pub/). Exemplary 
tools include the BLAST system available at httn://www.ncbi.nlm.nih.oov . using default 
settings. Pairwise and ClustalW alignments (BLOSUM30 matrix setting) as well as Kyte- 
Doolittle hydropathic analysis can be obtained using the Mac Vector sequence analysis 
software (Oxford Molecular Group). Watson-Crick complements of the foregoing nucleic 
acids also are embraced by the invention. 

In screening for cancer associated antigen genes, a Southern blot may be performed 
using the foregoing conditions, together with a radioactive probe. After washing the 
membrane to which the DNA is finally transferred, the membrane can be placed against X-ray 
film to detect the radioactive signal. In screening for the expression of cancer associated 
antigen nucleic acids, Northern blot hybridizations using the foregoing conditions (see also 
the Examples) can be performed on samples taken from breast cancer patients or subjects 
suspected of having a condition characterized by expression of breast cancer associated 
antigen genes. Amplification protocols such as polymerase chain reaction using primers 
which hybridize to the sequences presented also can be used for detection of the cancer 
associated antigen genes or expression thereof 

The renal cancer associated genes correspond to SEQ ID NOs. 1-11 and 22-35. 
Kinectin cancer associated sequences correspond to SEQ ID Nos:36-46. The preferred cancer 
associated antigens for the methods of diagnosis disclosed herein are those which were found 
to react with allogeneic cancer antisera (i.e. NA Group 5). Encoded polypeptides (e.g., 
proteins), peptides and antisera thereto are also preferred for diagnosis. 

The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
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purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating breast 
cancer associated antigen polypeptide. Similarly, nucleotide sequence triplets which encode 
other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline 
codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and 
ACT (threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT 
(isoleucine codons). Other amino acid residues may be encoded similarly by multiple 
nucleotide sequences. Thus, the invention embraces degenerate nucleic acids that differ from 
the biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic 
code. 

The invention also provides modified nucleic acid molecules which include additions, 
substitutions and deletions of one or more nucleotides. In preferred embodiments, these 
modified nucleic acid molecules and/or the polypeptides they encode retain at least one 
activity or function of the unmodified nucleic acid molecule and/or the polypeptides, such as 
antigenicity, enzymatic activity, receptor binding, formation of complexes by binding of 
peptides by MHC class I and class II molecules, etc. In certain embodiments, the modified 
nucleic acid molecules encode modified polypeptides, preferably polypeptides having 
conservative amino acid substitutions as are described elsewhere herein. The modified 
nucleic acid molecules are structurally related to the unmodified nucleic acid molecules and in 
preferred embodiments are sufficiently structurally related to the unmodified nucleic acid 
molecules so that the modified and unmodified nucleic acid molecules hybridize under 
stringent conditions known to one of skill in the art. 

For example, modified nucleic acid molecules which encode polypeptides having 
single amino acid changes can be prepared. Each of these nucleic acid molecules can have 
one, two or three nucleotide substitutions exclusive of nucleotide changes corresponding to 
the degeneracy of the genetic code as described herein. Likewise, modified nucleic acid 
molecules which encode polypeptides having two amino acid changes can be prepared which 
have, e.g., 2-6 nucleotide changes. Numerous modified nucleic acid molecules like these will 
be readily envisioned by one of skill in the art, including for example, substitutions of 
nucleotides in codons encoding amino acids 2 and 3, 2 and 4, 2 and 5, 2 and 6, and so on. In 
the foregoing example, each combination of two amino acids is included in the set of 
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modified nucleic acid molecules, as well as all nucleotide substitutions which code for the 
amino acid substitutions. Additional nucleic acid molecules that encode polypeptides having 
additional substitutions (i.e., 3 or more), additions or deletions (e.g., by introduction of a stop 
codon or a splice site(s)) also can be prepared and are embraced by the invention as readily 
envisioned by one of ordinary skill in the art. Any of the foregoing nucleic acids or 
polypeptides can be tested by routine experimentation for retention of structural relation or 
activity to the nucleic acids and/or polypeptides disclosed herein. 

The invention also provides isolated unique fragments of cancer associated antigen 
nucleic acid sequences or complements thereof. A unique fragment is one that is a 'signature' 
for the larger nucleic acid. It, for example, is long enough to assure that its precise sequence 
is not found in molecules within the human genome outside of the cancer associated antigen 
nucleic acids defined above (and human alleles). Those of ordinary skill in the art may apply 
no more than routine procedures to determine if a fragment is unique within the human 
genome. Unique fragments, however, exclude fragments completely composed of the 
nucleotide sequences of any of GenBank accession numbers listed in Table 1 or other 
previously published sequences as of the filing date of the priority documents for sequences 
listed in a respective priority document or the filing date of this application for sequences 
listed for the first time in this application which overlap the sequences of the invention. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits is one which does not include any of the nucleotides unique to the 
sequences of the invention. Thus, a unique fragment must contain a nucleotide sequence other 
than the exact sequence of those in GenBank or fragments thereof. The difference may be an 
addition, deletion or substitution with respect to the GenBank sequence or it may be a 
sequence wholly separate from the GenBank sequence. 

Unique fragments can be used as probes in Southern and Northern blot assays to 
identify such nucleic acids, or can be used in amplification assays such as those employing 
PCR. As known to those skilled in the art, large probes such as 200, 250, 300 or more 
nucleotides are preferred for certain uses such as Southern and Northern blots, while smaller 
fragments will be preferred for uses such as PCR. Unique fragments also can be used to 
produce fusion proteins for generating antibodies or determining binding of the polypeptide 
fragments, or for generating immunoassay components. Likewise, unique fragments can be 
employed to produce nonfiised fragments of the cancer associated antigen polypeptides. 
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useful, for example, in the preparation of antibodies, and in immunoassays. Unique fragments 
further can be used as antisense molecules to inhibit the expression of cancer associated 
antigen nucleic acids and polypeptides, particularly for therapeutic purposes as described in 
greater detail below. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of cancer associated 
antigen sequences and complements thereof will require longer segments to be unique while 
others will require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 
14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 or more bases long, 
up to the entire length of the disclosed sequence. As mentioned above, this disclosure intends 
to embrace each and every fragment of each sequence, beginning at the first nucleotide, the 
second nucleotide and so on, up to 8 nucleotides short of the end, and ending anywhere from 
nucleotide number 8, 9, 10 and so on for each sequence, up to the very last nucleotide 
(provided the sequence is unique as described above). 

Virtually any segment of the polypeptide coding region of novel cancer associated 
antigen nucleic acids, or complements thereof, that is 18 or more nucleotides in length will be 
unique. Those skilled in the art are well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the 
sequence of interest from other sequences in the human genome of the fragment to those on 
known databases typically is all that is necessary, although in vitro confirmatory hybridization 
and sequencing analysis may be performed. 

Especially preferred include nucleic acids encoding a series of epitopes, known as 
"polytopes". The epitopes can be arranged in sequential or overlapping fashion {see, e.g., 
Thomson et al., Proc. Natl Acad. ScL USA 92:5845-5849, 1995; Gilbert et al., Nature 
Biotechnol. 15:1280-1284, 1997), with or without the natural flanking sequences, and can be 
separated by unrelated linker sequences if desired. The polytope is processed to generated 
individual epitopes which are recognized by the immune system for generation of immune 
responses. 

Thus, for example, peptides derived from a polypeptide having an amino acid 
sequence encoded by one of the nucleic acid disclosed herein, and which are presented by 
MHC molecules and recognized by CTL or T helper lymphocytes, can be combined with 
peptides from one or more other cancer associated antigens (e.g. by preparation of hybrid 
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nucleic acids or polypeptides) to form "polytopes". The two or more peptides (or nucleic 
acids encoding the peptides) can be selected from those described herein, or they can include 
one or more peptides of previously known cancer associated antigens. Exemplary cancer 
associated peptide antigens that can be administered to induce or enhance an immune 
response are derived from tumor associated genes and encoded proteins including MAGE-A1, 
MAGE-A2, MAGE- A3, MAGE-A4, MAGE-A5, MAGE-A6, MAGE-A7, MAGE-A8, 
MAGE-A9, MAGE- A 10, MAGE-A1 1, MAGE-A12, MAGE- 13, GAGE- 1 , GAGE-2, GAGE- 
3, GAGE-4, GAGE-5, GAGE-6, GAGE-7, GAGE-8, B AGE- 1 , RAGE- 1 , LB33/MUM-1, 
PRAME, NAG, MAGE-B2, MAGE-B3, MAGE-B4, tyrosinase, brain glycogen 
phosphorylase, Melan-A, MAGE-C1 MAGE-C2, NY-ESO-1, LAGE-1, SSX-1, SSX-2 
(HOM-MEL-40) SSX-4, SSX-5, SCP-1 and CT-7. See. for example, PCT application 
publication no. WO96/10577. Other examples will be known to one of ordinary skill in the 
art (for example, see Coulie, Stem Cells 13:393-403, 1995), and can be used in the invention 
in a like manner as those disclosed herein. One of ordinary skill in the art can prepare 
polypeptides comprising one or more peptides and one or more of the foregoing cancer 
associated peptides, or nucleic acids encoding such polypeptides, according to standard 
procedures of molecular biology. 

Thus polytopes are groups of two or more potentially immunogenic or immune 
response stimulating peptides which can be joined together in various arrangements (e.g. 
concatenated, overlapping). The polytope (or nucleic acid encoding the polytope) can be 
administered in a standard immunization protocol, e.g. to animals, to test the effectiveness of 
the polytope in stimulating, enhancing and/or provoking an immune response. 

The peptides can be joined together directly or via the use of flanking sequences to 
form polytopes, and the use of polytopes as vaccines is well known in the art (see, e.g., 
Thomson et al., Proc. Acad. Natl. Acad. Sci USA 92(13):5845-5849, 1995; Gilbert et al., 
Nature Biotechnol. 15(12):1280-1284, 1997; Thomson et al., J. Immunol. 157(2):822-826, 
1 996; Tarn et al., J. Exp. Med. 1 71(l):299-306, 1 990). For example, Tarn showed that 
polytopes consisting of both MHC class I and class II binding epitopes successfully generated 
antibody and protective immunity in a mouse model. Tarn also demonstrated that polytopes 
comprising "strings" of epitopes are processed to yield individual epitopes which are 
presented by MHC molecules and recognized by CTLs. Thus polytopes containing various 
numbers and combinations of epitopes can be prepared and tested for recognition by CTLs 
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and for efficacy in increasing an immune response. 

It is known that tumors express a set of tumor antigens, of which only certain subsets 
may be expressed in the tumor of any given patient. Polytopes can be prepared which 
correspond to the different combination of epitopes representing the subset of tumor rejection 
antigens expressed in a particular patient. Polytopes also can be prepared to reflect a broader 
spectrum of tumor rejection antigens known to be expressed by a tumor type. Polytopes can 
be introduced to a patient in need of such treatment as polypeptide structures, or via the use of 
nucleic acid delivery systems known in the art (see, e.g., Allsopp et al., Eur. ./. Immunol. 
26(8): 195 1-1 959, 1996). Adenovirus, pox virus, Ty-virus like particles, adeno-associated 
virus, plasmids, bacteria, etc. can be used in such delivery. One can test the polytope delivery 
systems in mouse models to determine efficacy of the delivery system. The systems also can 
be tested in human clinical trials. 

In instances in which a human HLA class I molecule presents tumor rejection antigens 
derived from cancer associated nucleic acids, the expression vector may also include a nucleic 
acid sequence coding for the HLA molecule that presents any particular tumor rejection 
antigen derived from these nucleic acids and polypeptides. Alternatively, the nucleic acid 
sequence coding for such a HLA molecule can be contained within a separate expression 
vector. In a situation where the vector contains both coding sequences, the single vector can 
be used to transfect a cell which does not normally express either one. Where the coding 
sequences for a cancer associated antigen precursor and the HLA molecule which presents it 
are contained on separate expression vectors, the expression vectors can be cotransfected. The 
cancer associated antigen precursor coding sequence may be used alone, when, e.g. the host 
cell already expresses a HLA molecule which presents a cancer associated antigen derived 
from precursor molecules. Of course, there is no limit on the particular host cell which can be 
used. As the vectors which contain the two coding sequences may be used in any antigen- 
presenting cells if desired, and the gene for cancer associated antigen precursor can be used in 
host cells which do not express a HLA molecule which presents a cancer associated antigen. 
Further, cell-free transcription systems may be used in lieu of cells. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a cancer associated antigen polypeptide, 
to reduce the expression of cancer associated antigens. This is desirable in virtually any 
medical condition wherein a reduction of expression of cancer associated antigens is 
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desirable, e.g., in the treatment of cancer. This is also useful for in vitro or in vivo testing of 
the effects of a reduction of expression of one or more cancer associated antigens. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under 
physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 
skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 
to the target sequence than to any other sequence in the target cell under physiological 
conditions. Based upon the sequences of nucleic acids encoding breast cancer associated 
antigen, or upon allelic or homologous genomic and/or cDNA sequences, one of skill in the 
art can easily choose and synthesize any of a number of appropriate antisense molecules for 
use in accordance with the present invention. In order to be sufficiently selective and potent 
for inhibition, such antisense oligonucleotides should comprise at least 10 and, more 
preferably, at least 15 consecutive bases which are complementary to the target, although in 
certain cases modified oligonucleotides as short as 7 bases in length have been used 
successfully as antisense oligonucleotides (Wagner et al., Nature Biotechnol. 14:840-844, 
1996). Most preferably, the antisense oligonucleotides comprise a complementary sequence 
of 20-30 bases. Although oligonucleotides may be chosen which are antisense to any region 
of the gene or mRNA transcripts, in preferred embodiments the antisense oligonucleotides 
correspond to N-terminal or 5' upstream sites such as translation initiation, transcription 
initiation or promoter sites. In addition, 3 '-untranslated regions may be targeted. Targeting to 
mRNA splicing sites has also been used in the art but may be less preferred if alternative 
mRNA splicing occurs. In addition, the antisense is targeted, preferably, to sites in which 
mRNA secondary structure is not expected (see, e.g., Sainio et al., Cell Mol. Neurobiol. 
14(5):439-457, 1994) and at which proteins are not expected to bind. Finally, although the 
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listed sequences are cDNA sequences, one of ordinary skill in the art may easily derive the 
genomic DNA corresponding to the cDNA of a cancer associated antigen. Thus, the present 
invention also provides for antisense oligonucleotides which are complementary to the 
genomic DNA corresponding to nucleic acids encoding cancer associated antigens. Similarly, 
antisense to allelic or homologous cDNAs and genomic DNAs are enabled without undue 
experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester intemucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways which do not prevent them from hybridizing to their target but which 
enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 

The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one 
nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally 
associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic intemucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
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arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
hybridizable with, under physiological conditions, nucleic acids encoding breast cancer 
associated antigen polypeptides, together with pharmaceutical ly acceptable carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical 
composition. Such a pharmaceutical composition may include the antisense oligonucleotides 
in combination with any standard physiologically and/or pharmaceutical^ acceptable carriers 
which are known in the art. The compositions should be sterile and contain a therapeutically 
effective amount of the antisense oligonucleotides in a unit of weight or volume suitable for 
administration to a patient. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 
characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art, as further described below. 

As used herein, a "vector" may be any of a number of nucleic acids into which a 
desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of DNA 
although RNA vectors are also available. Vectors include, but are not limited to, plasmids, 
phagemids and virus genomes. A cloning vector is one which is able to replicate 
autonomously or integrated in the genone in a host cell, and which is further characterized by 
one or more endonuclease restriction sites at which the vector may be cut in a determinable 
fashion and into which a desired DNA sequence may be ligated such that the new 
recombinant vector retains its ability to replicate in the host cell. In the case of plasmids, 
replication of the desired sequence may occur many times as the plasmid increases in copy 
number within the host bacterium or just a single time per host before the host reproduces by 
mitosis. In the case of phage, replication may occur actively during a lytic phase or passively 
during a lysogenic phase. An expression vector is one into which a desired DNA sequence 
may be inserted by restriction and ligation such that it is operably joined to regulatory 
sequences and may be expressed as an RNA transcript. Vectors may further contain one or 
more marker sequences suitable for use in the identification of cells which have or have not 
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been transformed or transfected with the vector. Markers include, for example, genes 
encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or 
other compounds, genes which encode enzymes whose activities are detectable by standard 
assays known in the art (e.g., 13-galactosidase, luciferase or alkaline phosphatase), and genes 
which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or 
plaques (e.g., green fluorescent protein). Preferred vectors are those capable of autonomous 
replication and expression of the structural gene products present in the DNA segments to 
which they are operably joined. 

As used herein, a coding sequence and regulatory sequences are said to be "operably" 
joined when they are covalently linked in such a way as to place the expression or 
transcription of the coding sequence under the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5' 
regulatory sequences results in the transcription of the coding sequence and if the nature of the 
linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region to direct the transcription 
of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript 
to be translated into a protein. Thus, a promoter region would be operably joined to a coding 
sequence if the promoter region were capable of effecting transcription of that DNA sequence 
such that the resulting transcript might be translated into the desired protein or polypeptide. 

The precise nature of the regulatory sequences needed for gene expression may vary 
between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
and 5' non-translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5' non-transcribed regulatory sequences will include a promoter region 
which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired. The vectors of the invention may optionally include 5' leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 

Expression vectors containing all the necessary elements for expression are 
commercially available and known to those skilled in the art. See, e.g., Sambrook et al., 
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Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of 
heterologous DNA (RNA) encoding a breast cancer associated antigen polypeptide or 
fragment or variant thereof. That heterologous DNA (RNA) is placed under operable control 
of transcriptional elements to permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gene that confers G4 18 resistance (which facilitates the selection of stably transfected cell 
lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, 
suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which 
contains an Epstein Barr Virus (EBV) origin of replication, facilitating the maintenance of 
plasmid as a multicopy extrachromosomal element. Another expression vector is the pEF- 
BOS plasmid containing the promoter of polypeptide Elongation Factor la, which stimulates 
efficiently transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nut: 
Acids Res. 18:5322, 1990), and its use in transfection experiments is disclosed by, for 
example, Demoulin (Mo/. Cell. Biol. 16:4710-4716, 1996). Still another preferred expression 
vector is an adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 
proteins (J. Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.PlA 
recombinant for the expression of an antigen is disclosed by Warnier et al., in intradermal 
injection in mice for immunization against PI A (Int. J. Cancer, 67:303-310, 1996). 
Additional vectors for delivery of nucleic acid are provided below. 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of a vector and one or more of the previously discussed cancer associated antigen 
nucleic acid molecules. Other components may be added, as desired, as long as the previously 
mentioned nucleic acid molecules, which are required, are included. The invention also 
includes kits for amplification of a cancer associated antigen nucleic acid, including at least 
one pair of amplification primers which hybridize to a cancer associated antigen nucleic acid. 
The primers preferably are 12-32 nucleotides in length and are non-overlapping to prevent 
formation of "primer-dimers". One of the primers will hybridize to one strand of the cancer 
associated antigen nucleic acid and the second primer will hybridize to the complementary 
strand of the cancer associated antigen nucleic acid, in an arrangement which permits 
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amplification of the cancer associated antigen nucleic acid. Selection of appropriate primer 
pairs is standard in the art. For example, the selection can be made with assistance of a 
computer program designed for such a purpose, optionally followed by testing the primers for 
amplification specificity and efficiency. 

The invention also permits the construction of cancer associated antigen gene "knock- 
outs" in cells and in animals, providing materials for studying certain aspects of cancer and 
immune system responses to cancer. 

The invention also provides isolated polypeptides (including whole proteins and 
partial proteins) encoded by the foregoing cancer associated antigen nucleic acids. Such 
polypeptides are useful, for example, alone or as fusion proteins to generate antibodies, as 
components of an immunoassay or diagnostic assay or as therapeutics. Cancer associated 
antigen polypeptides can be isolated from biological samples including tissue or cell 
homogenates, and can also be expressed recombinantly in a variety of prokaryotic and 
eukaryotic expression systems by constructing an expression vector appropriate to the 
expression system, introducing the expression vector into the expression system, and isolating 
the recombinantly expressed protein. Short polypeptides, including antigenic peptides (such 
as are presented by MHC molecules on the surface of a cell for immune recognition) also can 
be synthesized chemically using well-established methods of peptide synthesis. 

A unique fragment of a cancer associated antigen polypeptide, in general, has the 
features and characteristics of unique fragments as discussed above in connection with nucleic 
acids. As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon factors such as whether the fragment constitutes a portion of a conserved protein 
domain. Thus, some regions of cancer associated antigens will require longer segments to be 
unique while others will require only short segments, typically between 5 and 12 amino acids 
(e.g. 5, 6, 7, 8, 9, 10, 1 1 or 12 or more amino acids including each integer up to the full 
length). 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct functional capability of the polypeptide. Functional capabilities which can be retained 
in a unique fragment of a polypeptide include interaction with antibodies, interaction with 
other polypeptides or fragments thereof, selective binding of nucleic acids or proteins, and 
enzymatic activity. One important activity is the ability to act as a signature for identifying 
the polypeptide. Another is the ability to complex with HLA and to provoke in a human an 
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immune response. Those skilled in the art are well versed in methods for selecting unique 
amino acid sequences, typically on the basis of the ability of the unique fragment to 
selectively distinguish the sequence of interest from non-family members. A comparison of 
the sequence of the fragment to those on known databases typically is all that is necessary. 

The invention embraces variants of the cancer associated antigen polypeptides 
described above. As used herein, a "variant" of a cancer associated antigen polypeptide is a 
polypeptide which contains one or more modifications to the primary amino acid sequence of 
a cancer associated antigen polypeptide. Modifications which create a cancer associated 
antigen variant can be made to a cancer associated antigen polypeptide 1) to reduce or 
eliminate an activity of a cancer associated antigen polypeptide; 2) to enhance a property of a 
cancer associated antigen polypeptide, such as protein stability in an expression system or the 
stability of protein-protein binding; 3) to provide a novel activity or property to a cancer 
associated antigen polypeptide, such as addition of an antigenic epitope or addition of a 
detectable moiety; or 4) to provide equivalent or better binding to an HLA molecule. 
Modifications to a cancer associated antigen polypeptide are typically made to the nucleic 
acid which encodes the cancer associated antigen polypeptide, and can include deletions, point 
mutations, truncations, amino acid substitutions and additions of amino acids or non-amino 
acid moieties. Alternatively, modifications can be made directly to the polypeptide, such as 
by cleavage, addition of a linker molecule, addition of a detectable moiety, such as biotin, 
addition of a fatty acid, and the like. Modifications also embrace fusion proteins comprising 
all or part of the cancer associated antigen amino acid sequence. One of skill in the art will be 
familiar with methods for predicting the effect on protein conformation of a change in protein 
sequence, and can thus "design" a variant cancer associated antigen polypeptide according to 
known methods. One example of such a method is described by Dahiyat and Mayo in Science 
278:82-87, 1997, whereby proteins can be designed de novo. The method can be applied to a 
known protein to vary a only a portion of the polypeptide sequence. By applying the 
computational methods of Dahiyat and Mayo, specific variants of a cancer associated antigen 
polypeptide can be proposed and tested to determine whether the variant retains a desired 
conformation. 

In general, variants include cancer associated antigen polypeptides which are modified 
specifically to alter a feature of the polypeptide unrelated to its desired physiological activity. 
For example, cysteine residues can be substituted or deleted to prevent unwanted disulfide 



BNSDOCID: <WO 0Q20587A2_I_> 



r 



WO 00/20587 PCT/US99/22873 

-29- 

Iinkages. Similarly, certain amino acids can be changed to enhance expression of a breast 
cancer associated antigen polypeptide by eliminating proteolysis by proteases in an expression 
system (e.g., dibasic amino acid residues in yeast expression systems in which KEX2 protease 
activity is present). 

Mutations of a nucleic acid which encode a cancer associated antigen polypeptide 
preferably preserve the amino acid reading frame of the coding sequence, and preferably do 
not create regions in the nucleic acid which are likely to hybridize to form secondary 
structures, such a hairpins or loops, which can be deleterious to expression of the variant 
polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
made to variants (or to non-variant cancer associated antigen polypeptides) which are silent as 
to the amino acid sequence of the polypeptide, but which provide preferred codons for 
translation in a particular host. The preferred codons for translation of a nucleic acid in, e.g., 
E. coli, are well known to those of ordinary skill in the art. Still other mutations can be made 
to the noncoding sequences of a cancer associated antigen gene or cDNA clone to enhance 
expression of the polypeptide. The activity of variants of cancer associated antigen 
polypeptides can be tested by cloning the gene encoding the variant cancer associated antigen 
polypeptide into a bacterial or mammalian expression vector, introducing the vector into an 
appropriate host cell, expressing the variant cancer associated antigen polypeptide, and testing 
for a functional capability of the cancer associated antigen polypeptides as disclosed herein. 
For example, the variant cancer associated antigen polypeptide can be tested for reaction with 
autologous or allogeneic sera as disclosed in the Examples. Preparation of other variant 
polypeptides may favor testing of other activities, as will be known to one of ordinary skill in 
the art. 

The skilled artisan will also realize that conservative amino acid substitutions may be 
made in cancer associated antigen polypeptides to provide functionally equivalent variants of 
the foregoing polypeptides, i.e, the variants retain the functional capabilities of the cancer 
associated antigen polypeptides. As used herein, a "conservative amino acid substitution" 
refers to an amino acid substitution which does not alter the relative charge or size 
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characteristics of the protein in which the amino acid substitution is made. Variants can be 
prepared according to methods for altering polypeptide sequence known to one of ordinary 
skill in the art such as are found in references which compile such methods, e.g. Molecular 
Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York, 1 989, or Current Protocols in Molecular 
Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. Exemplary 
functionally equivalent variants of the cancer associated antigen polypeptides include 
conservative amino acid substitutions of in the amino acid sequences of proteins disclosed 
herein. Conservative substitutions of amino acids include substitutions made amongst amino 
acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; 
(f) Q, N; and (g) E, D. 

For example, upon determining that a peptide derived from a cancer associated antigen 
polypeptide is presented by an MHC molecule and recognized by CTLs (e.g., as described in 
the Examples), one can make conservative amino acid substitutions to the amino acid 
sequence of the peptide, particularly at residues which are thought not to be direct contact 
points with the MHC molecule. For example, methods for identifying functional variants of 
HLA class II binding peptides are provided in a published PCT application of Strominger and 
Wucherpfennig (PCT/US96/03 1 82). Peptides bearing one or more amino acid substitutions 
also can be tested for concordance with known HLA/MHC motifs prior to synthesis using, 
e.g. the computer program described by D'Amaro and Drijfhout (D'Amaro et al., Human 
Immunol. 43:13-18, 1995; Drijfhout et al., Human Immunol. 43:1-12, 1995). The substituted 
peptides can then be tested for binding to the MHC molecule and recognition by CTLs when 
bound to MHC. These variants can be tested for improved stability and are useful, inter alia, 
in vaccine compositions. 

Conservative amino-acid substitutions in the amino acid sequence of cancer associated 
antigen polypeptides to produce functionally equivalent variants of cancer associated antigen 
polypeptides typically are made by alteration of a nucleic acid encoding a cancer associated 
antigen polypeptide. Such substitutions can be made by a variety of methods known to one of 
ordinary skill in the art. For example, amino acid substitutions may be made by PCR-directed 
mutation, site-directed mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. 
Acad. Sci. U.S.A. 82: 488-492, 1985), or by chemical synthesis of a gene encoding a cancer 
associated antigen polypeptide. Where amino acid substitutions are made to a small unique 
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fragment of a cancer associated antigen polypeptide, such as an antigenic epitope recognized 
by autologous or allogeneic sera or cytolytic T lymphocytes, the substitutions can be made by 
directly synthesizing the peptide. The activity of functionally equivalent fragments of cancer 
associated antigen polypeptides can be tested by cloning the gene encoding the altered cancer 
associated antigen polypeptide into a bacterial or mammalian expression vector, introducing 
the vector into an appropriate host cell, expressing the altered cancer associated antigen 
polypeptide, and testing for a functional capability of the cancer associated antigen 
polypeptides as disclosed herein. Peptides which are chemically synthesized can be tested 
directly for function, e.g., for binding to antisera recognizing associated antigens. 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits isolation of the cancer associated antigen 
protein molecules. A variety of methodologies well-known to the skilled practitioner can be 
utilized to obtain isolated cancer associated antigen molecules. The polypeptide may be 
purified from cells which naturally produce the polypeptide by chromatographic means or 
immunological recognition. Alternatively, an expression vector may be introduced into cells 
to cause production of the polypeptide. In another method, mRNA transcripts may be 
microinjected or otherwise introduced into cells to cause production of the encoded 
polypeptide. Translation of mRNA in cell-free extracts such as the reticulocyte lysate system 
also may be used to produce polypeptide. Those skilled in the art also can readily follow 
known methods for isolating cancer associated antigen polypeptides. These include, but are 
not limited to, immunochromatography, HPLC, size-exclusion chromatography, ion-exchange 
chromatography and immune-affinity chromatography. 

The isolation and identification of cancer associated antigen genes also makes it 
possible for the artisan to diagnose a disorder characterized by expression of cancer associated 
antigens. These methods involve determining expression of one or more cancer associated 
antigen nucleic acids, and/or encoded cancer associated antigen polypeptides and/or peptides 
derived therefrom. In the former situation, such determinations can be carried out via any 
standard nucleic acid determination assay, including the polymerase chain reaction, or 
assaying with labeled hybridization probes. In the latter situation, such determinations can be 
carried out by screening patient antisera for recognition of the polypeptide. 

The invention also makes it possible isolate proteins which bind to cancer associated 
antigens as disclosed herein, including antibodies and cellular binding partners of the cancer 
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associated antigens. Additional uses are described further herein. 

The invention also provides, in certain embodiments, "dominant negative" 
polypeptides derived from cancer associated antigen polypeptides. A dominant negative 
polypeptide is an inactive variant of a protein, which, by interacting with the cellular 
machinery, displaces an active protein from its interaction with the cellular machinery or 
competes with the active protein, thereby reducing the effect of the active protein. For 
example, a dominant negative receptor which binds a ligand but does not transmit a signal in 
response to binding of the ligand can reduce the biological effect of expression of the ligand. 
Likewise, a dominant negative catalytically-inactive kinase which interacts normally with 
target proteins but does not phosphorylate the target proteins can reduce phosphorylation of 
the target proteins in response to a cellular signal. Similarly, a dominant negative 
transcription factor which binds to a promoter site in the control region of a gene but does not 
increase gene transcription can reduce the effect of a normal transcription factor by occupying 
promoter binding sites without increasing transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the 
potential for a dominant negative variant of a protein, and using standard mutagenesis 
techniques to create one or more dominant negative variant polypeptides. For example, given 
the teachings contained herein of renal cancer associated antigens, especially those which are 
similar to known proteins which have known activities, one of ordinary skill in the art can 
modify the sequence of the cancer associated antigens by site-specific mutagenesis, scanning 
mutagenesis, partial gene deletion or truncation, and the like. See, e.g., U.S. Patent No. 
5,580,723 and Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press, 1989. The skilled artisan then can test the population 
of mutagenized polypeptides for diminution in a selected and/or for retention of such an 
activity. Other similar methods for creating and testing dominant negative variants of a 
protein will be apparent to one of ordinary skill in the art. 

The invention also involves agents such as polypeptides which bind to cancer 
associated antigen polypeptides. Such binding agents can be used, for example, in screening 
assays to detect the presence or absence of cancer associated antigen polypeptides and 
complexes of cancer associated antigen polypeptides and their binding partners and in 
purification protocols to isolated cancer associated antigen polypeptides and complexes of 
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cancer associated antigen polypeptides and their binding partners. Such agents also can be 
used to inhibit the native activity of the cancer associated antigen polypeptides, for example, 
by binding to such polypeptides. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind to cancer 
associated antigen polypeptides. Antibodies include polyclonal and monoclonal antibodies, 
prepared according to conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foundations of Modern Immunology Wiley & 
Sons, Inc., New York; Roitt, I. (1991) Essential Immunology . 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab') 2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
an antibody from which the Fc region has been enzymatically cleaved, or which has been 
produced without the Fc region, designated an Fab fragment, retains one of the antigen 
binding sites of an intact antibody molecule. Proceeding further, Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may 
be associated with up to ten different light chains without altering antibody specificity) and Fd 
fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 
are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FR1 through FR4) 
separated respectively by three complementarity determining regions (CDR1 through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 
are largely responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian 
antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
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while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-human CDRs 
are covalently joined to human FR and/or Fc/pFc' regions to produce a functional antibody. 
See, e.g., U.S. patents 4,816,567, 5,225,539, 5,585,089, 5,693,762 and 5,859,205. 

Thus, for example, PCT International Publication Number WO 92/04381 teaches the 
production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
including fragments of intact antibodies with antigen-binding ability, are often referred to as 
"chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab') 2 , Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

Thus, the invention involves polypeptides of numerous size and type that bind 
specifically to cancer associated antigen polypeptides, and complexes of both cancer 
associated antigen polypeptides and their binding partners. These polypeptides may be 
derived also from sources other than antibody technology. For example, such polypeptide 
binding agents can be provided by degenerate peptide libraries which can be readily prepared 
in solution, in immobilized form or as phage display libraries. Combinatorial libraries also 
can be synthesized of peptides containing one or more amino acids. Libraries further can be 
synthesized of peptoids and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the cancer associated antigen 
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polypeptide. This process can be repeated through several cycles of reselection of phage that 
bind to the cancer associated antigen polypeptide. Repeated rounds lead to enrichment of 
phage bearing particular sequences. DNA sequence analysis can be conducted to identify the 
sequences of the expressed polypeptides. The minimal linear portion of the sequence that 
binds to the cancer associated antigen polypeptide can be determined. One can repeat the 
procedure using a biased library containing inserts containing part or all of the minimal linear 
portion plus one or more additional degenerate residues upstream or downstream thereof. 
Yeast two-hybrid screening methods also may be used to identify polypeptides that bind to the 
cancer associated antigen polypeptides. Thus, the cancer associated antigen polypeptides of 
the invention, or a fragment thereof, can be used to screen peptide libraries, including phage 
display libraries, to identify and select peptide binding partners of the cancer associated 
antigen polypeptides of the invention. Such molecules can be used, as described, for 
screening assays, for purification protocols, for interfering directly with the functioning of 
cancer associated antigen and for other purposes that will be apparent to those of ordinary 
skill in the art. 

As detailed herein, the foregoing antibodies and other binding molecules may be used 
for example to identify tissues expressing protein or to purify protein. Antibodies also may be 
coupled to specific diagnostic labeling agents for imaging of cells and tissues that express 
cancer associated antigens or to therapeutically useful agents according to standard coupling 
procedures. Diagnostic agents include, but are not limited to, barium sulfate, iocetamic acid, 
iopanoic acid, ipodate calcium, diatrizoate sodium, diatrizoate meglumine, metrizamide, 
tyropanoate sodium and radiodiagnostics including positron emitters such as fluorine- 18 and 
carbon- 1 1, gamma emitters such as iodine- 123, technitium-99m, iodine-131 and indium- 1 11, 
nuclides for nuclear magnetic resonance such as fluorine and gadolinium. Other diagnostic 
agents useful in the invention will be apparent to one of ordinary skill in the art. As used 
herein, "therapeutically useful agents" include any therapeutic molecule which desirably is 
targeted selectively to a cell expressing one of the cancer antigens disclosed herein, including 
antineoplastic agents, radioiodinated compounds, toxins, other cytostatic or cytolytic drugs, 
and so forth. Antineoplastic therapeutics are well known and include: aminoglutethimide, 
azathioprine, bleomycin sulfate, busulfan, carmustine, chlorambucil, cisplatin, 
cyclophosphamide, cyclosporine, cytarabidine, dacarbazine, dactinomycin, daunorubicin, 
doxorubicin, taxol, etoposide, fluorouracil, interferon-oc, lomustine, mercaptopurine, 
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methotrexate, mitotane, procarbazine HC1, thioguanine, vinblastine sulfate and vincristine 
sulfate. Additional antineoplastic agents include those disclosed in Chapter 52, 
Antineoplastic Agents (Paul Calabresi and Bruce A. Chabner), and the introduction thereto, 
1202-1263, of Goodman and Oilman's "The Pharmacological Basis of Therapeutics", Eighth 
Edition, 1990, McGraw-Hill, Inc. (Health Professions Division). Toxins can be proteins such 
as, for example, pokeweed anti-viral protein, cholera toxin, pertussis toxin, ricin, gelonin, 
abrin, diphtheria exotoxin, or Pseudomonas exotoxin. Toxin moieties can also be high 
energy-emitting radionuclides such as cobalt-60. 

In the foregoing methods, antibodies prepared according to the invention also 
preferably are specific for the renal cancer associated antigen/MHC complexes described 
herein. 

When "disorder" is used herein, it refers to any pathological condition where the 
cancer associated antigens are expressed. An example of such a disorder is cancer, breast, 
colon, gastric, renal, prostate and lung cancers as particular examples. 

Samples of tissue and/or cells for use in the various methods described herein can be 
obtained through standard methods such as tissue biopsy, including punch biopsy and cell 
scraping, and collection of blood or other bodily fluids by aspiration or other methods. 

In certain embodiments of the invention, an immunoreactive cell sample is removed 
from a subject. By "immunoreactive cell" is meant a cell which can mature into an immune 
cell (such as a B cell, a helper T cell, or a cytolytic T cell) upon appropriate stimulation. Thus 
immunoreactive cells include CD34 + hematopoietic stem cells, immature T cells and 
immature B cells. When it is desired to produce cytolytic T cells which recognize a cancer 
associated antigen, the immunoreactive cell is contacted with a cell which expresses a cancer 
associated antigen under conditions favoring production, differentiation and/or selection of 
cytolytic T cells; the differentiation of the T cell precursor into a cytolytic T cell upon 
exposure to antigen is similar to clonal selection of the immune system. 

Some therapeutic approaches based upon the disclosure are premised on a response by 
a subject's immune system, leading to lysis of antigen presenting cells, such as breast cancer 
cells which present one or more cancer associated antigens. One such approach is the 
administration of autologous CTLs specific to a cancer associated antigen/MHC complex to a 
subject with abnormal cells of the phenotype at issue. It is within the ability of one of 
ordinary skill in the art to develop such CTLs in vitro. An example of a method for T cell 
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differentiation is presented in International Application number PCT/US96/05607. Generally, 
a sample of cells taken from a subject, such as blood cells, are contacted with a cell presenting 
the complex and capable of provoking CTLs to proliferate. The target cell can be a 
transfectant, such as a COS cell. These transfectants present the desired complex of their 
surface and, when combined with a CTL of interest, stimulate its proliferation. COS cells are 
widely available, as are other suitable host cells. Specific production of CTL clones is well 
known in the art. The clonally expanded autologous CTLs then are administered to the 
subject. 

Another method for selecting antigen-specific CTL clones has recently been described 
(Altaian et al., Science 274:94-96, 1996; Dunbar et ah, Curr. Biol. 8:413-416, 1998), in which 
fluorogenic tetramers of MHC class I molecule/peptide complexes are used to detect specific 
CTL clones. Briefly, soluble MHC class I molecules are folded in vitro in the presence of p,- 
microglobulin and a peptide antigen which binds the class I molecule. After purification, the 
MHC/peptide complex is purified and labeled with biotin. Tetramers are formed by mixing 
the biotinylated peptide-MHC complex with labeled avidin (e.g. phycoerythrin) at a molar 
ratio or 4:1 . Tetramers are then contacted with a source of CTLs such as peripheral blood or 
lymph node. The tetramers bind CTLs which recognize the peptide antigen/MHC class I 
complex. Cells bound by the tetramers can be sorted by fluorescence activated cell sorting to 
isolate the reactive CTLs. The isolated CTLs then can be expanded in vitro for use as 
described herein. 

To detail a therapeutic methodology, referred to as adoptive transfer (Greenberg, J. 
Immunol. 136(5): 1917, 1986; Riddel et al., Science 257: 238, 1992; Lynch et al, Eur. J. 
Immunol. 21: 1403-1410,1991; Kast et al., Cell 59: 603-614, 1989), cells presenting the 
desired complex (e.g., dendritic cells) are combined with CTLs leading to proliferation of the 
CTLs specific thereto. The proliferated CTLs are then administered to a subject with a 
cellular abnormality which is characterized by certain of the abnormal cells presenting the 
particular complex. The CTLs then lyse the abnormal cells, thereby achieving the desired 
therapeutic goal. 

The foregoing therapy assumes that at least some of the subject's abnormal cells 
present the relevant HLA/cancer associated antigen complex. This can be determined very 
easily, as the art is very familiar with methods for identifying cells which present a particular 
HLA molecule, as well as how to identify cells expressing DNA of the pertinent sequences, in 



BNSDOCID: <WO 0020587A2_I_> 



WO 00/20587 PCT/US99/22873 

-38- 

this case a cancer associated antigen sequence. Once cells presenting the relevant complex are 
identified via the foregoing screening methodology, they can be combined with a sample from 
a patient, where the sample contains CTLs. If the complex presenting cells are lysed by the 
mixed CTL sample, then it can be assumed that a cancer associated antigen is being presented, 
and the subject is an appropriate candidate for the therapeutic approaches set forth supra. 

Adoptive transfer is not the only form of therapy that is available in accordance with 
the invention. CTLs can also be provoked in vivo, using a number of approaches. One 
approach is the use of non-proliferative cells expressing the complex. The cells used in this 
approach may be those that normally express the complex, such as irradiated tumor cells or 
cells transfected with one or both of the genes necessary for presentation of the complex (i.e. 
the antigenic peptide and the presenting HLA molecule). Chen et al. (Proc. Natl. Acad. Sci. 
USA 88: 1 10-1 14,1991) exemplifies this approach, showing the use of transfected cells 
expressing HPVE7 peptides in a therapeutic regime. Various cell types may be used. 
Similarly, vectors carrying one or both of the genes of interest may be used. Viral or bacterial 
vectors are especially preferred. For example, nucleic acids which encode a cancer associated 
antigen polypeptide or peptide may be operably linked to promoter and enhancer sequences 
which direct expression of the cancer associated antigen polypeptide or peptide in certain 
tissues or cell types. The nucleic acid may be incorporated into an expression vector. 
Expression vectors may be unmodified extrachromosomal nucleic acids, plasmids or viral 
genomes constructed or modified to enable insertion of exogenous nucleic acids, such as those 
encoding cancer associated antigen, as described elsewhere herein. Nucleic acids encoding a 
cancer associated antigen also may be inserted into a retroviral genome, thereby facilitating 
integration of the nucleic acid into the genome of the target tissue or cell type. In these 
systems, the gene of interest is carried by a microorganism, e.g., a Vaccinia virus, pox virus, 
herpes simplex virus, retrovirus or adenovirus, and the materials de facto "infect" host cells. 
The cells which result present the complex of interest, and are recognized by autologous 
CTLs, which then proliferate. 

A similar effect can be achieved by combining the cancer associated antigen or a 
stimulatory fragment thereof with an adjuvant to facilitate incorporation into antigen 
presenting cells in vivo. The cancer associated antigen polypeptide is processed to yield the 
peptide partner of the HLA molecule while a cancer associated antigen peptide may be 
presented without the need for further processing. Generally, subjects can receive an 
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intradermal injection of an effective amount of the cancer associated antigen. Initial doses can 
be followed by booster doses, following immunization protocols standard in the art. Preferred 
cancer associated antigens include those found to react with allogeneic cancer antisera, shown 
in the examples below. 

The invention involves the use of various materials disclosed herein to "immunize" 
subjects or as "vaccines". As used herein, "immunization" or "vaccination" means increasing 
or activating an immune response against an antigen. It does not require elimination or 
eradication of a condition but rather contemplates the clinically favorable enhancement of an 
immune response toward an antigen. Generally accepted animal models can be used for 
testing of immunization against cancer using a cancer associated antigen nucleic acid. For 
example, human cancer cells can be introduced into a mouse to create a tumor, and one or 
more cancer associated antigen nucleic acids can be delivered by the methods described 
herein. The effect on the cancer cells (e.g., reduction of tumor size) can be assessed as a 
measure of the effectiveness of the cancer associated antigen nucleic acid immunization. Of 
course, testing of the foregoing animal model using more conventional methods for 
immunization include the administration of one or more cancer associated antigen 
polypeptides or peptides derived therefrom, optionally combined with one or more adjuvants 
and/or cytokines to boost the immune response. Methods for immunization, including 
formulation of a vaccine composition and selection of doses, route of administration and the 
schedule of administration (e.g. primary and one or more booster doses), are well known in 
the art. The tests also can be performed in humans, where the end point is to test for the 
presence of enhanced levels of circulating CTLs against cells bearing the antigen, to test for 
levels of circulating antibodies against the antigen, to test for the presence of cells expressing 
the antigen and so forth. 

As part of the immunization compositions, one or more cancer associated antigens or 
stimulatory fragments thereof are administered with one or more adjuvants to induce an 
immune response or to increase an immune response. An adjuvant is a substance incorporated 
into or administered with antigen which potentiates the immune response. Adjuvants may 
enhance the immunological response by providing a reservoir of antigen (extracellularly or 
within macrophages), activating macrophages and stimulating specific sets of lymphocytes. 
Adjuvants of many kinds are well known in the art. Specific examples of adjuvants include 
monophosphoryl lipid A (MPL, SmithKline Beecham), a congener obtained after purification 
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and acid hydrolysis of Salmonella minnesota Re 595 ^polysaccharide; saponins including 
QS21 (SmithKline Beecham), a pure QA-21 saponin purified from Quillja saponaria extract; 
DQS21, described in PCT application W096/33739 (SmithKline Beecham); QS-7, QS-17, 
QS-18,andQS-Ll (So et al., Mol. Cells 7:178-186, 1 997); incomplete Freund's adjuvant; 
complete Freund's adjuvant; montanide; and various water-in-oil emulsions prepared from 
biodegradable oils such as squalene and/or tocopherol. Preferably, the peptides are 
administered mixed with a combination of DQS21/MPL. The ratio of DQS21 to MPL 
typically will be about l:10to 10:1 , preferably about 1:5 to 5:1 and more preferably about 1:1. 
Typically for human administration, DQS21 and MPL will be present in a vaccine 
formulation in the range of about 1 ug to about 100 ug. Other adjuvants are known in the art 
and can be used in the invention (see, e.g. Coding, Monoclonal Antibodies: Principles and 
Practice, 2nd Ed., 1986). Methods for the preparation of mixtures or emulsions of peptide 
and adjuvant are well known to those of skill in the art of vaccination. 

Other agents which stimulate the immune response of the subject can also be 
administered to the subject. For example, other cytokines are also useful in vaccination 
protocols as a result of their lymphocyte regulatory properties. Many other cytokines useful 
for such purposes will be known to one of ordinary skill in the art, including interleukin-12 
(IL-12) which has been shown to enhance the protective effects of vaccines (see, e.g., Science 
268: 1432-1434, 1995), GM-CSF and IL-18. Thus cytokines can be administered in 
conjunction with antigens and adjuvants to increase the immune response to the antigens. 

There are a number of immune response potentiating compounds that can be used in 
vaccination protocols. These include costimulatory molecules provided in either protein or 
nucleic acid form. Such costimulatory molecules include the B7-1 and B7-2 (CD80 and CD86 
respectively) molecules which are expressed on dendritic cells (DC) and interact with the 
CD28 molecule expressed on the T cell. This interaction provides costimulation (signal 2) to 
an antigen/MHC/TCR stimulated (signal 1) T cell, increasing T cell proliferation and effector 
function. B7 also interacts with CTLA4 (CD 152) on T cells and studies involving CTLA4 
and B7 ligands indicate that the B7-CTLA4 interaction can enhance antitumor immunity and 
CTL proliferation (Zheng P., et al. Proc. Natl. Acad. Sci. USA 95 (1 1):6284-6289 (1998)). 

B7 typically is not expressed on tumor cells so they are not efficient antigen presenting 
cells (APCs) for T cells. Induction of B7 expression would enable the tumor cells to stimulate 
more efficiently CTL proliferation and effector function. A combination of B7/IL-6/IL-12 



BNSDOCID: <WO 0020587A2_I_> 



¥ 



WO 00/20587 PCT/US99/22873 

-41 - 

costimulation has been shown to induce IFN-gamma and a Thl cytokine profile in the T cell 
population leading to further enhanced T cell activity (Gajewski et al., J. Immunol, 154:5637- 
5648 (1995)). Tumor cell transfection with B7 has ben discussed in relation to in vitro CTL 
expansion for adoptive transfer immunotherapy by Wang et al., (J. Immunol., 19:1-8 (1986)). 
Other delivery mechanisms for the B7 molecule would include nucleic acid (naked DNA) 
immunization (Kim J., et al. Nat Biotechnol, 15:7:641-646 (1997)) and recombinant viruses 
such as adeno and pox (Wendtner et al.. Gene Ther., 4:7:726-735 (1 997)). These systems are 
all amenable to the construction and use of expression cassettes for the coexpression of B7 
with other molecules of choice such as the antigens or fragment(s) of antigens discussed 
herein (including polytopes) or cytokines. These delivery systems can be used for induction 
of the appropriate molecules in vitro and for in vivo vaccination situations. The use of anti- 
CD28 antibodies to directly stimulate T cells in vitro and in vivo could also be considered. 
Similarly, the inducible co-stimulatory molecule ICOS which induces T cell responses to 
foreign antigen could be modulated, for example, by use of anti-ICOS antibodies (Hutloff et 
al., Nature 397:263-266, 1999). 

Lymphocyte function associated antigen-3 (LFA-3) is expressed on APCs and some 
tumor cells and interacts with CD2 expressed on T cells. This interaction induces T cell IL-2 
and IFN-gamma production and can thus complement but not substitute, the B7/CD28 
costimulatory interaction (Parra et al., J. Immunol, 158:637-642 (1997), Fenton et al., J. 
Immunother., 21:2:95-108 (1998)). 

Lymphocyte function associated antigen- 1 (LFA-1) is expressed on leukocytes and 
interacts with ICAM-1 expressed on APCs and some tumor cells. This interaction induces T 
cell IL-2 and IFN-gamma production and can thus complement but not substitute, the 
B7/CD28 costimulatory interaction (Fenton et al., J. Immunother., 21:2:95-108 (1998)). 
LFA-1 is thus a further example of a costimulatory molecule that could be provided in a 
vaccination protocol in the various ways discussed above for B7. 

Complete CTL activation and effector function requires Th cell help through the 
interaction between the Th cell CD40L (CD40 ligand) molecule and the CD40 molecule 
expressed by DCs (Ridge et al., Nature, 393:474 (1998), Bennett et al., Nature, 393:478 
(1998), Schoenberger et al., Nature, 393:480 (1998)). This mechanism of this costimulatory 
signal is likely to involve upregulation of B7 and associated IL-6/IL-12 production by the DC 
(APC). The CD40-CD40L interaction thus complements the signal 1 (antigen/MHC-TCR) 
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and signal 2 (B7-CD28) interactions. 

The use of anti-CD40 antibodies to stimulate DC cells directly, would be expected to 
enhance a response to tumor antigens which are normally encountered outside of a 
inflammatory context or are presented by non-professional APCs (tumor cells). In these 
situations Th help and B7 costimulation signals are not provided. This mechanism might be 
used in the context of antigen pulsed DC based therapies or in situations where Th epitopes 
have not been defined within known TRA precursors. 

A cancer associated antigen polypeptide, or a fragment thereof, also can be used to 
isolate their native binding partners. Isolation of such binding partners may be performed 
according to well-known methods. For example, isolated cancer associated antigen 
polypeptides can be attached to a substrate (e.g., chromatographic media, such as polystyrene 
beads, or a filter), and then a solution suspected of containing the binding partner may be 
applied to the substrate. If a binding partner which can interact with cancer associated antigen 
polypeptides is present in the solution, then it will bind to the substrate-bound cancer 
associated antigen polypeptide. The binding partner then may be isolated. 

It will also be recognized that the invention embraces the use of the cancer associated 
antigen cDNA sequences in expression vectors, as well as to transfect host cells and cell lines, 
be these prokaryotic (e.g., E. coli), or eukaryotic (e.g., dendritic cells, B cells, CHO cells, 
COS cells, yeast expression systems and recombinant baculovirus expression in insect cells). 
Especially useful are mammalian cells such as human, mouse, hamster, pig, goat, primate, etc. 
They may be of a wide variety of tissue types, and include primary cells and cell lines. 
Specific examples include keratinocytes, peripheral blood leukocytes, bone marrow stem cells 
and embryonic stem cells. The expression vectors require that the pertinent sequence, i.e., 
those nucleic acids described supra, be operably linked to a promoter. 

The invention also contemplates delivery of nucleic acids, polypeptides or peptides for 
vaccination. Delivery of polypeptides and peptides can be accomplished according to 
standard vaccination protocols which are well known in the art. In another embodiment, the 
delivery of nucleic acid is accomplished by ex vivo methods, i.e. by removing a cell from a 
subject, genetically engineering the cell to include a breast cancer associated antigen, and 
reintroducing the engineered cell into the subject. One example of such a procedure is 
outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of that patent, all 
of which are publicly available documents. In general, it involves introduction in vitro of a 
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functional copy of a gene into a cell(s) of a subject, and returning the genetically engineered 
cell(s) to the subject. The functional copy of the gene is under operable control of regulatory 
elements which permit expression of the gene in the genetically engineered cell(s). Numerous 
transfection and transduction techniques as well as appropriate expression vectors are well 
known to those of ordinary skill in the art, some of which are described in PCT application 
WO95/00654. In vivo nucleic acid delivery using vectors such as viruses and targeted 
liposomes also is contemplated according to the invention. 

In preferred embodiments, a virus vector for delivering a nucleic acid encoding a 
cancer associated antigen is selected from the group consisting of adenoviruses, adeno- 
associated viruses, poxviruses including vaccinia viruses and attenuated poxviruses, Semliki 
Forest virus, Venezuelan equine encephalitis virus, retroviruses, Sindbis virus, and Ty virus- 
like particle. Examples of viruses and virus-like particles which have been used to deliver 
exogenous nucleic acids include: replication-defective adenoviruses (e.g., Xiang et al., 
Virology 219:220-227, 1996; Eloit et al., J. Virol 7:5375-5381, 1997; Chengalvala et al.. 
Vaccine 15:335-339, 1997), a modified retrovirus (Townsend et al., J. Virol. 71 :3365-3374, 
1997), a nonreplicating retrovirus (Irwin et aL, ./ Virol 68:5036-5044, 1994), a replication 
defective Semliki Forest virus (Zhao et al., Proa Natl Acad Sci. USA 92:3009-3013, 1995), 
canarypox virus and highly attenuated vaccinia virus derivative (Paoletti, Proc. Natl Acad. 
Sci. USA 93:1 1349-1 1353, 1996), non-replicative vaccinia virus (Moss, Proc. Natl Acad. Sci. 
USA 93:1 1341-11348, 1996), replicative vaccinia virus (Moss, Dev. Biol Stand. 82:55-63, 
1994), Venzuelan equine encephalitis virus (Davis et al., J. Virol 70:3781-3787, 1996), 
Sindbis virus (Pugachev et al., Virology 212:587-594, 1995), and Ty virus-like particle 
(Allsopp et al., Eur. J. Immunol 26:1951-1959, 1996). In preferred embodiments, the virus 
vector is an adenovirus. 

Another preferred virus for certain applications is the adeno-associated virus, a double- 
stranded DNA virus. The adeno-associated virus is capable of infecting a wide range of cell 
types and species and can be engineered to be replication-deficient. It further has advantages, 
such as heat and lipid solvent stability, high transduction frequencies in cells of diverse 
lineages, including hematopoietic cells, and lack of superinfection inhibition thus allowing 
multiple series of transductions. The adeno-associated virus can integrate into human cellular 
DNA in a site-specific manner, thereby minimizing the possibility of insertional mutagenesis 
and variability of inserted gene expression. In addition, wild-type adeno-associated virus 
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infections have been followed in tissue culture for greater than 100 passages in the absence of 
selective pressure, implying that the adeno-associated virus genomic integration is a relatively 
stable event. The adeno-associated virus can also function in an extrachromosomal fashion. 

In general, other preferred viral vectors are based on non-cytopathic eukaryotic viruses 
in which non-essential genes have been replaced with the gene of interest. Non-cytopathic 
viruses include retroviruses, the life cycle of which involves reverse transcription of genomic 
viral RNA into DNA with subsequent proviral integration into host cellular DNA. 
Adenoviruses and retroviruses have been approved for human gene therapy trials. In general, 
the retroviruses are replication-deficient (i.e., capable of directing synthesis of the desired 
proteins, but incapable of manufacturing an infectious particle). Such genetically altered 
retroviral expression vectors have general utility for the high-efficiency transduction of genes 
in vivo. Standard protocols for producing replication-deficient retroviruses (including the 
steps of incorporation of exogenous genetic material into a plasmid, transfection of a 
packaging cell lined with plasmid, production of recombinant retroviruses by the packaging 
cell line, collection of viral particles from tissue culture media, and infection of the target cells 
with viral particles) are provided in Kriegler, M., "Gene Transfer and Expression, A 
Laboratory Manual," W.H. Freeman Co., New York (1 990) and Murry, E.J. Ed. "Methods in 
Molecular Biology," vol. 7, Humana Press, Inc., Cliffton, New Jersey (1991). 

Preferably the foregoing nucleic acid delivery vectors: (1) contain exogenous genetic 
material that can be transcribed and translated in a mammalian cell and that can induce an 
immune response in a host, and (2) contain on a surface a ligand that selectively binds to a 
receptor on the surface of a target cell, such as a mammalian cell, and thereby gains entry to 
the target cell. 

Various techniques may be employed for introducing nucleic acids of the invention 
into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 
Such techniques include transfection of nucleic acid-CaP0 4 precipitates, transfection of 
nucleic acids associated with DEAE, transfection or infection with the foregoing viruses 
including the nucleic acid of interest, liposome mediated transfection, and the like. For 
certain uses, it is preferred to target the nucleic acid to particular cells. In such instances, a 
vehicle used for delivering a nucleic acid of the invention into a cell (e.g., a retrovirus, or 
other virus; a liposome) can have a targeting molecule attached thereto. For example, a 
molecule such as an antibody specific for a surface membrane protein on the target cell or a 
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ligand for a receptor on the target cell can be bound to or incorporated within the nucleic acid 
delivery vehicle. Preferred antibodies include antibodies which selectively bind a cancer 
associated antigen, alone or as a complex with a MHC molecule. Especially preferred are 
monoclonal antibodies. Where liposomes are employed to deliver the nucleic acids of the 
invention, proteins which bind to a surface membrane protein associated with endocytosis 
may be incorporated into the liposome formulation for targeting and/or to facilitate uptake. 
Such proteins include capsid proteins or fragments thereof tropic for a particular cell type, 
antibodies for proteins which undergo internalization in cycling, proteins that target 
intracellular localization and enhance intracellular half life, and the like. Polymeric delivery 
systems also have been used successfully to deliver nucleic acids into cells, as is known by 
those skilled in the art. Such systems even permit oral delivery of nucleic acids. 

When administered, the therapeutic compositions of the present invention can be 
administered in pharmaceutically acceptable preparations. Such preparations may routinely 
contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, 
compatible carriers, supplementary immune potentiating agents such as adjuvants and 
cytokines and optionally other therapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 
oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When antibodies are used therapeutically, a preferred route of administration is by pulmonary 
aerosol. Techniques for preparing aerosol delivery systems containing antibodies are well 
known to those of skill in the art. Generally, such systems should utilize components which 
will not significantly impair the biological properties of the antibodies, such as the paratope 
binding capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington's 
Pharmaceutical Sciences. 18th edition, 1990, pp 1694-1712; incorporated by reference). 
Those of skill in the art can readily determine the various parameters and conditions for 
producing antibody aerosols without resort to undue experimentation. When using antisense 
preparations of the invention, slow intravenous administration is preferred. 

The compositions of the invention are administered in effective amounts. An 
"effective amount" is that amount of a cancer associated antigen composition that alone, or 
together with further doses, produces the desired response, e.g. increases an immune response 
to the cancer associated antigen. In the case of treating a particular disease or condition 
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characterized by expression of one or more cancer associated antigens, such as renal cancer, 
the desired response is inhibiting the progression of the disease. This may involve only 
slowing the progression of the disease temporarily, although more preferably, it involves 
halting the progression of the disease permanently. This can be monitored by routine methods 
or can be monitored according to diagnostic methods of the invention discussed herein. The 
desired response to treatment of the disease or condition also can be delaying the onset or 
even preventing the onset of the disease or condition. 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical condition, 
size and weight, the duration of the treatment, the nature of concurrent therapy (if any), the 
specific route of administration and like factors within the knowledge and expertise of the 
health practioner. These factors are well known to those of ordinary skill in the art and can be 
addressed with no more than routine experimentation. It is generally preferred that a 
maximum dose of the individual components or combinations thereof be used, that is, the 
highest safe dose according to sound medical judgment. It will be understood by those of 
ordinary skill in the art, however, that a patient may insist upon a lower dose or tolerable dose 
for medical reasons, psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are sterile 
and contain an effective amount of cancer associated antigen or nucleic acid encoding cancer 
associated antigen for producing the desired response in a unit of weight or volume suitable 
for administration to a patient. The response can, for example, be measured by determining 
the immune response following administration of the cancer associated antigen composition 
via a reporter system by measuring downstream effects such as gene expression, or by 
measuring the physiological effects of the cancer associated antigen composition, such as 
regression of a rumor or decrease of disease symptoms. Other assays will be known to one of 
ordinary skill in the art and can be employed for measuring the level of the response. 

The doses of cancer associated antigen compositions (e.g., polypeptide, peptide, 
antibody, cell or nucleic acid) administered to a subject can be chosen in accordance with 
different parameters, in particular in accordance with the mode of administration used and the 
state of the subject. Other factors include the desired period of treatment. In the event that a 
response in a subject is insufficient at the initial doses applied, higher doses (or effectively 
higher doses by a different, more localized delivery route) may be employed to the extent that 
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patient tolerance permits. 

In general, for treatments for eliciting or increasing an immune response, doses of 
cancer associated antigen are formulated and administered in doses between 1 ng and 1 mg, 
and preferably between 10 ng and 100 Mg, according to any standard procedure in the art. 
Where nucleic acids encoding cancer associated antigen of variants thereof are employed, 
doses of between 1 ng and 0.1 mg generally will be formulated and administered according to 
standard procedures. Other protocols for the administration of cancer associated antigen 
compositions will be known to one of ordinary skill in the art, in which the dose amount, 
schedule of injections, sites of injections, mode of administration (e.g., intra-tumoral) and the 
like vary from the foregoing. Administration of cancer associated antigen compositions to 
mammals other than humans, e.g. for testing purposes or veterinary therapeutic purposes, is 
carried out under substantially the same conditions as described above. 

Where cancer associated antigen peptides are used for vaccination, modes of 
administration which effectively deliver the cancer associated antigen and adjuvant, such that 
an immune response to the antigen is increased, can be used. For administration of a cancer 
associated antigen peptide in adjuvant, preferred methods include intradermal, intravenous, 
intramuscular and subcutaneous administration. Although these are preferred embodiments, 
the invention is not limited by the particular modes of administration disclosed herein. 
Standard references in the art (e.g., Remington 's Pharmaceutical Sciences, 1 8th edition, 1 990) 
provide modes of administration and formulations for delivery of immunogens with adjuvant 
or in a non-adjuvant carrier. 

When administered, the pharmaceutical preparations of the invention are applied in 
pharmaceutically-acceptable amounts and in pharmaceutically-acceptable compositions. The 
term "pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredients. Such preparations may 
routinely contain salts, buffering agents, preservatives, compatible carriers, and optionally 
other therapeutic agents. When used in medicine, the salts should be pharmaceutically 
acceptable, but non-pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the 
invention. Such pharmacologically and pharmaceutically-acceptable salts include, but are not 
limited to, those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, 
nitric, phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic, and the like. 
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Also, pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth 
salts, such as sodium, potassium or calcium salts. 

A renal cancer associated antigen composition may be combined, if desired, with a 
pharmaceutically-acceptable carrier. The term "pharmaceutically-acceptable carrier" as used 
herein means one or more compatible solid or liquid fillers, diluents or encapsulating 
substances which are suitable for administration into a human. The term "carrier" denotes an 
organic or inorganic ingredient, natural or synthetic, with which the active ingredient is 
combined to facilitate the application. The components of the pharmaceutical compositions 
also are capable of being co-mingled with the molecules of the present invention, and with 
each other, in a manner such that there is no interaction which would substantially impair the 
desired pharmaceutical efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a salt. 

The pharmaceutical compositions also may contain, optionally, suitable preservatives, 
such as: benzalkonium chloride; chlorobutanol; parabens and thimerosal. 

The pharmaceutical compositions may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well-known in the art of pharmacy. All methods 
include the step of bringing the active agent into association with a carrier which constitutes 
one or more accessory ingredients. In general, the compositions are prepared by uniformly 
and intimately bringing the active compound into association with a liquid carrier, a finely 
divided solid carrier, or both, and then, if necessary, shaping the product. 

Compositions suitable for oral administration may be presented as discrete units, such 
as capsules, tablets, lozenges, each containing a predetermined amount of the active 
compound. Other compositions include suspensions in aqueous liquids or non-aqueous 
liquids such as a syrup, elixir or an emulsion. 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of breast cancer associated antigen polypeptides or 
nucleic acids, which is preferably isotonic with the blood of the recipient. This preparation 
may be formulated according to known methods using suitable dispersing or wetting agents 
and suspending agents. The sterile injectable preparation also may be a sterile injectable 
solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for example, 
as a solution in 1,3-butane diol. Among the acceptable vehicles and solvents that may be 
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employed are water, Ringer's solution, and isotonic sodium chloride solution. In addition, 
sterile, fixed oils are conventionally employed as a solvent or suspending medium. For this 
purpose any bland fixed oil may be employed including synthetic mono-or di-glycerides. In 
addition, fatty acids such as oleic acid may be used in the preparation of injectables. Carrier 
formulation suitable for oral, subcutaneous, intravenous, intramuscular, etc. administrations 
can be found in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinantly produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulate by 
recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained in 
a vector in which 5' and 3' restriction sites are known or for which polymerase chain reaction 
(PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is 
used herein because it is readily manipulate by standard techniques known to those of 
ordinary skill in the art. An isolated nucleic acid as used herein is not a naturally occurring 
chromosome. 

As used herein with respect to polypeptides, "isolated" means separated from its native 
environment and present in sufficient quantity to permit its identification or use. Isolated, 
when referring to a protein or polypeptide, means, for example: (i) selectively produced by 
expression cloning or (ii) purified as by chromatography or electrophoresis. Isolated proteins 
or polypeptides may, but need not be, substantially pure. The term "substantially pure" means 
that the proteins or polypeptides are essentially free of other substances with which they may 
be found in nature or in vivo systems to an extent practical and appropriate for their intended 
use. Substantially pure polypeptides may be produced by techniques well known in the art. 
Because an isolated protein may be admixed with a pharmaceutical ly acceptable carrier in a 
pharmaceutical preparation, the protein may comprise only a small percentage by weight of 
the preparation. The protein is nonetheless isolated in that it has been separated from the 
substances with which it may be associated in living systems, i.e. isolated from other proteins. 
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Examples 

Example 1: SEREX screening of renal cancer cell line 1973/10.4 

A standard cDNA library was prepared using 5 ug of poly A + RNA derived from the 
cell line 1973/10.4. A primary (unamplified) cDNA library was immunoscreened (5 x 10 5 
clones per library) by standard SEREX methodology, with absorbed autologous patient serum 
at 1:200 dilution [Sahin, U. et al., Proc Natl Acad Sci USA 92:1 1810-3 (1995); Chen, Y.T. et 
al. Proc Natl Acad Sci USA. 94:1914-8 (1997)].. Excluding false-positive clones encoding 
immunoglobulin gene fragments, clones were purified and sequence analyzed. Comparisons 
of the sequences showed that these clones represented cDNAs from 22 distinct genes, 
designated NY-REN-45 through NY-LU-66 (Table A and Sequence Listing (SEQ ID NOs: 1 - 
21)). A homology search through the GenBank/EMBO databases revealed that 14 of the 22 
genes corresponded to previously known molecules, and 8 others were unknown genes, with 
sequence identity limited only to short segments of known genes or to expressed sequence 
tags (ESTs). 

Analysis of isolated clones: 

I. NY-REN clones which are known gene products 



Designation 


Gene/Sequence Identity 


Accession 
Number 


SEQ ID NO 


NY-REN-46 


lactate dehydrogenase B 


Y00711 


22 


NY-REN-47 


ERK tyrosine kinase 


D31661 


23 


NY-REN-48 


PINCH protein 


U09284 


24 


NY-REN-5 1 


BBP/53BP2 


U58334 


25 


NY-REN-52 


steroid receptor coactivator 


U59302 


26 


NY-REN-5 3 


KIAA0336 mRNA tag 


AB002334 


27 


NY-REN-54 


E6 oncogenic protein- 
associated protein 


X98033 


28 


NY-REN-5 5 


murine NEK1 protein kinase 
homologue 


S45828 


29 
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NY-REN-56 


6-phospho-fructokinase 


D49817 


30 


NY-REN-59 


lactate dehydrogenase A 


X02152 


31 


NY-REN-61 


KIAA0081 mRNA tag 


D42039 


32 


NY-REN-63 


DDB pl27-associated protein 


AF035950 


33 


NY-REN-65 


HREV107 protein 


X92814 


34 


NY-REN-66 


acidic ribosomal 
phosphoprotein 2 


MI 7887 


35 



II. Novel gene products 

Clone SEQ ID NO: Size Tissue mRNA expression Protein SEP ID NO: 



NY-REN-45 


1 


4.0kb 


Ubiquitous 


12 


NY-REN-49 


2 


l.lkb 


Ubiquitous 


13 


NY-REN-50 


3 


1.8kb 


Ubiquitous 


14 


NY-REN-57 


4,5 


2.9kb 


Ubiquitous 


15, 16 


NY-REN-58 


6 


1.9kb 


Ubiquitous 


17 


NY-REN-60 


7 


4.0kb 


Ubiquitous 


18 


NY-REN-62 


8,9 


2.7kb 


Ubiquitous 


19, 20 


NY-REN-64 


10,11 


3.0kb 


Ubiquitous 


21 



III. Clones which react with autologous sera only: 

NY-REN-47 

NY-REN-49 

NY-REN-50 



IV. Clones which react with sera from normal control donors 

Frequency of sera reactivity 
Clone normal cancer patient 

NY-REN-46 4/4 6/14 

NY-REN-48 14/14 17/17 

NY-REN-5 1 1/12 3/17 
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NY-REN-52 


4/12 


7/17 


NY-REN-53 


4/19 


10/31 


NY-REN-54 


5/8 


7/7 


NY-REN-55 


3/19 


6/31 


NY-REN-56 


3/19 


7/31 


NY-REN-57 


1/19 


3/31 


NY-REN-58 


1/12 


2/17 


NY-REN-59 


1/19 


4/31 


NY-REN-61 


3/19 


5/31 


NY-REN-62 


1/19 


4/31 


NY-REN-63 


2/19 


12/31 


NY-REN-64 


3/19 


6/31 


NY-REN-65 


2/19 


2/31 



V. Clones which react with sera from cancer patients only (failed to react- with- 19 normal 
patient serum samples). These clones are preferred for therapeutic and diagnostic 
applications. 



Frequency of reactivity 



NY-REN-32 3/31 

NY-REN-45 3/31 

NY-REN-57 2/31 

NY-REN-60 5/31 

NY-REN-66 2/31 



VI. Additional allogeneic screening of NY-REN renal SEREX clones 

Renal SEREX clones were tested for reactivity with sera from the normal and various 
cancer patients listed below. 

. Sera 



>_0020587A2_I_> 



t 



WO 00/20587 



PCT/US99/22873 



53 



Clone 


normal 


colon 


renal 


lung 


breast 


NY-REN-3 


0/26 


7/37 


8/32 


0/23 


1/26 


NY-REN- 12 


0/19 


0/16 


3/32 


0/15 


0/16 


NY-REN- 19 


0/19 


0/16 


2/32 


0/15 


0/16 


NY-REN-21 


0/16 


3/16 


3/32 


1/15 


0/16 


NY-REN-25 


0/15 


0/16 


5/32 


0/15 


0/16 


NY-REN-3 1 


0/14 


0/16 


5/32 


0/15 


0/16 


NY-REN-3 2 


0/14 


2/16 


3/32 


0/15 


0/16 


NY-REN-3 7 


0/15 


0/16 


2/32 


0/15 


0/16 


NY-REN-45 


0/14 


0/16 


3/32 


1/15 


0/16 


NY-REN-57 


0/19 


0/16 


2/32 


0/15 


0/16 


NY-REN-60 


0/19 


0/16 


7/32 


0/15 


0/16 


NY-REN-66 


0/19 


0/16 


2/32 


0/15 


0/16 



Example 2: SERE X screening of tumor cells and testis libraries 

Standard cDNA libraries were prepared using poly A + RN A derived from the various 
cancer sources, as well as testis. The cDNA libraries were immunoscreened by standard 
SEREX methodology, with absorbed autologous patient serum [Sahin, U. et al., Proc Natl 
AcadSci USA 92:1 1810-3 (1995); Chen, Y.T. etal. Proc Natl Acad Sci USA. 94:1914-8 
(1997)]. Excluding false-positive clones encoding immunoglobulin gene fragments, clones 
were purified and sequence analyzed. Comparisons of the sequences showed that these clones 
all were substantially identical to the kinectin cDNA (GenBank Accession number L25616). 



Table A: Kinectin clones isolated hv SF/RFX 




Clone designation 


SEO ID NO 


Library 


Patient sera for screen 


NGO-St-47 


36 


Gastric tumor 


autologous 


TS-64-5' 


37 


Testis 


seminoma 


HOM-TSOv3-41 (5') 


38 


Testis 


seminoma 


HOM-TSOv3-41 (3') 


39 


Testis 


seminoma 


HOM-HD2-2(3') 


40 


Hodgkin's disease 


autologous 


HOM-HD2-232(3 f ) 


41 


Hodgkin's disease 


autologous 


Thy4 (3*) 


42 


Thyroid tumor 


autologous 
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Thy4 (3*) 
Thy5 (3') 
Thy8 (3') 
Thy 10 (3') 
NGO-Br-1 



42 
43 
44 
45 
46 



Thyroid tumor 
Thyroid tumor 
Thyroid tumor 
Thyroid tumor 
Breast cancer 



autologous 
autologous 
autologous 
autologous 
autologous 



In addition, a hepatocarcinoma library screened with autologous sera also identified several 
clones which were sequenced and found to be substantially identical to kinectin. 

Allotyping (gastric cancer) 

12/12 gastric cancer patient sera recognized kinectin 
0/27 normal individual sera recognized kinectin 

Therefore it was determined that recognition of kinectin was diagnostic for cancer patients. 

Example 3: Preparation of recombinant cancer associated antigens 

To facilitate screening of patients' sera for antibodies reactive with cancer associated 
antigens, for example by ELISA, recombinant proteins are prepared according to standard 
procedures. In one method, the clones encoding cancer associated antigens are subcloned into 
a baculovirus expression vector, and the recombinant expression vectors are introduced into 
appropriate insect cells. Baculovirus/insect cloning systems are preferred because post- 
translational modifications are carried out in the insect cells. Another preferred eukaryotic 
system is the Drosophila Expression System from Invitrogen. Clones which express high 
amounts of the recombinant protein are selected and used to produce the recombinant 
proteins. The recombinant proteins are tested for antibody recognition using serum from the 
patient which was used to isolated the particular clone, or in the case of cancer associated 
antigens recognized by allogeneic sera, by the sera from any of the patients used to isolate the 
clones or sera which recognize the clones' gene products. 

Alternatively, the cancer associated antigen clones are inserted into a prokaryotic 
expression vector for production of recombinant proteins in bacteria. Other systems, 
including yeast expression systems and mammalian cell culture systems also can be used. 



Example 4: Preparation of antibodies to cancer associated antigens 
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The recombinant cancer associated antigens produced as in Example 4 above are used 
to generate polyclonal antisera and monoclonal antibodies according to standard procedures. 
The antisera and antibodies so produced are tested for correct recognition of the cancer 
associated antigens by using the antisera/antibodies in assays of cell extracts of patients 
known to express the particular cancer associated antigen (e.g. an ELISA assay). These 
antibodies can be used for experimental purposes (e.g. localization of the cancer associated 
antigens, immunoprecipitations, Western blots, etc.) as well as diagnostic purposes (e.g., 
testing extracts of tissue biopsies, testing for the presence of cancer associated antigens). 

Example 5: Expression of cancer associated antigens in cancers of similar and different 
rigin. 

The expression of one or more of the cancer associated antigens is tested in a range of 
tumor samples to determine which, if any, other malignancies should be diagnosed and/or 
treated by the methods described herein. Tumor cell lines and tumor samples are tested for 
cancer associated antigen expression, preferably by RT-PCR according to standard 
procedures. Northern blots also are used to test the expression of the cancer associated 
antigens. Antibody based assays, such as ELISA and western blot, also can be used to 
determine protein expression. A preferred method of testing expression of cancer associated 
antigens (in other cancers and in additional same type cancer patients) is allogeneic serotyping 
using a modified SEREX protocol (as described above). 

In all of the foregoing, extracts from the tumors of patients who provided sera for the 
initial isolation of the cancer associated antigens are used as positive controls. The cells 
containing recombinant expression vectors described in the Examples above also can be used 
as positive controls. 

The results generated from the foregoing experiments provide panels of multiple 
cancer associated nucleic acids and/or polypeptides for use in diagnostic (e.g. determining the 
existence of cancer, determining the prognosis of a patient undergoing therapy, etc.) and 
therapeutic methods (e.g., vaccine composition, etc.). 

Example 6: HLA typing of patients positive for cancer associated antigens 

To determine which HLA molecules present peptides derived from the cancer 
associated antigens of the invention, cells of the patients which express the cancer associated 
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antigens are HLA typed. Peripheral blood lymphocytes are taken from the patient and typed 
for HLA class I or class II, as well as for the particular subtype of class I or class II. Tumor 
biopsy samples also can be used for typing. HLA typing can be carried out by any of the 
standard methods in the art of clinical immunology, such as by recognition by specific 
monoclonal antibodies, or by HLA allele-specific PCR (e.g. as described in W097/31 126). 

Example 7: Characterization of cancer associated antigen peptides presented by MHC 
class I and class II molecules. 

Antigens which provoke an antibody response in a subject may also provoke a cell- 
mediated immune response. Cells process proteins into peptides for presentation on MHC 
class I or class II molecules on the cell surface for immune surveillance. Peptides presented 
by certain MHC/HLA molecules generally conform to motifs. These motifs are known in 
some cases, and can be used to screen the renal cancer associated antigens for the presence of 
potential class I and/or class II peptides. Summaries of class I and class II motifs have been 
published (e.g., Rammensee et al., Immunogenetics 41:178-228, 1995). Based on the results 
of experiments such as those described above, the HLA types which present the individual 
breast cancer associated antigens are known. Motifs of peptides presented by these HLA 
molecules thus are preferentially searched. 

One also can search for class I and class II motifs using computer algorithms. For 
example, computer programs for predicting potential CTL epitopes based on known class I 
motifs has been described {see, e.g., Parker et al, J. Immunol. 152: 163, 1994; D'Amaro et al., 
Human Immunol. 43:13-18, 1995; Drijfhout et al., Human Immunol. 43:1-12, 1995). 
Computer programs for predicting potential T cell epitopes based on known class II motifs 
has also been described (see, e.g Sturniolo et al., Nat Biotechnol 17(6):555-61, 1999). HLA 
binding predictions can conveniently be made using an algorithm available via the Internet on 
the National Institutes of Health World Wide Web site at URL http://bimas.dcrt.nih.gov . See 
also the website of: SYFPEITHI: An Internet Database for MHC Ligands and Peptide Motifs 
(access via http://www.uni-tuebingen.de/uni/kxi / or 

http://l 34. 2.96.22 1 /scripts/hlaserver.dll/EpPredict.htm . Methods for determining HLA class 
II peptides and making substitutions thereto are also known (e.g. Strominger and 
Wucherpfennig (PCT/US96/03 1 82)). 
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Example 8: Identification of the portion of a cancer associated polypeptide encoding an 
antigen 

To determine if the cancer associated antigens isolated as described above can provoke 
a cytolytic T lymphocyte response, the following method is performed. CTL clones are 
generated by stimulating the peripheral blood lymphocytes (PBLs) of a patient with 
autologous normal cells transfected with one of the clones encoding a cancer associated 
antigen polypeptide or with irradiated PBLs loaded with synthetic peptides corresponding to 
the putative protein and matching the consensus for the appropriate HLA class I molecule (as 
described above) to localize an antigenic peptide within the cancer associated antigen clone 
(see, e.g., Knuth et al., Proc. Natl. Acad. Sci. USA 81:351 1-3515, 1984; van der Bruggen et 
aL, Eur. J. Immunol. 24: 3 03 8-3 043, 1994). These CTL clones are screened for specificity 
against COS cells transfected with the cancer associated antigen clone and autologous HLA 
alleles as described by Brichard et al. (Eur. J. Immunol. 26:224-230, 1996). CTL recognition 
of a cancer associated antigen is determined by measuring release of TNF from the cytolytic T 
lymphocyte or by 5l Cr release assay (Herin et al., Int. J. Cancer 39:390-396, 1987). If a CTL 
clone specifically recognizes a transfected COS cell, then shorter fragments of the cancer 
associated antigen clone transfected in that COS cell are tested to identify the region of the 
gene that encodes the peptide. Fragments of the cancer associated antigen clone are prepared 
by exonuclease III digestion or other standard molecular biology methods. Synthetic peptides 
are prepared to confirm the exact sequence of the antigen. 

Optionally, shorter fragments of cancer associated antigen cDNAs are generated by 
PCR. Shorter fragments are used to provoke TNF release or 5l Cr release as above. 

Synthetic peptides corresponding to portions of the shortest fragment of the cancer 
associated antigen clone which provokes TNF release are prepared. Progressively shorter 
peptides are synthesized to determine the optimal cancer associated antigen tumor rejection 
antigen peptides for a given HLA molecule. 

A similar method is performed to determine if the cancer associated antigen contains 
one or more HLA class II peptides recognized by T cells. One can search the sequence of the 
cancer associated antigen polypeptides for HLA class II motifs as described above. In 
contrast to class I peptides, class II peptides are presented by a limited number of cell types. 
Thus for these experiments, dendritic cells or B cell clones which express HLA class II 
molecules preferably are used. 
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Table 1: Sequence homolog ies 

SEQ ID NO. : 1 

AB002794, U4 6118RNU4 6118U194 82AF02 621 6AB0027 3 9 , AB002730 AB002728 
AF058796, AB002777, AF020187, AF009411, AB015609, AF006627 M95076 U43527 
AB002741, U25846, AF006628, AF019043, D83352, 01^355, M99575 U48288 
AF090440, AB007504, U82480, Y15794, AJ005572, AF029349, L10111 S80963 
U38894, L41731, AF022733, AJ225108, Y11879, AF001688, ^33214 Z97178 
AF009413, AF019907, AF016371, X71980, AF001522, M77169, AF023132 D83476 
AJ009675, U90554, AF069324, AF048691, Z34799, AF004947 U60149 ^22732' 
AF019887, L02937, U55848, AF011331, AF045770, AF043533 X69524 M32882 

Y14339 ' AJ005969 ' U90555; 5 U4443of 8 u?4296, 
AF001501, AJ223316, D78609, U41060, Z54362, AF034387, U62398, AB005545 
AB002533, Y13865, AF009959, L02938, U10555, AL010246, AC003041, U37699 
S75970, AF071010, Z81311, AB003681, X16353, AB002731 X53081, D63450 
D10911, D90115, X97065, Z82275, AB016891, Y09455, X77990, , W58357 W07820 
AA188593, W81046, AA858164, AI018124, AI139112, AA769634 W00437, D80849 
AA448160, N98650, T31293, H15307, H51146, N29314, AA770301, AA187822 
AA978299, T31823, AA936410, AA993194, N41386, W45601, W81099, C0569l' 
D60153, AA780119, AA929004, R68608, Z43271, N37024, AA974718, AA928663 
T30120, AA936583, H51109, N20251, W92917, T36035, AA480197, Z24908 N79450 
AA872019, AA902275, N59810, R13443, AA740162, AA937759, N72168, AA970708 ' 
T36257, AA433833, H15700, D51185, AA249138, R37356, W93028, Z28641 
AA371494, N72757, AA719126, N56164, T31790, D57384, Z41959, N58984' 
AA847848, AA587009, Z39343, AA654834, AA505490, Z38243, D20344 AA252395 
AA025593, H70133, R09101, AA159862, AI032981, Z28355, C75020, AI139642 ' 
D82421, AI126922, C75170, AI016032, C18748, C75478, AI129334 C75472 ' 
AA310765, D63057, C05952, AA357303, C16591, D82132, L48852, D82799 C75176 
AI124552, AA669404, U30155, C75118, AA374918, C75108, C05853, U30151 
D57346, AI127548, AA918527, AA317816, AA573490, R21699, AA917928 R36311 
AA361522, AA701252, AI085492, H44387, AA156256, AA587935, AA976510 

W73374, T27986, N34493, AA737770, N32609, N32612, H64420, , 
AA413717, AA117350, AA242502, AA117343, AA545256, AA795651, 
U31322, AA681967, AA221922, AA600546, AA050610, AU018628, C80932, 
AA863834, AI099036, AA183239, AI115182, AA590910, W65628, 
, AA109440, AI052952, AA999324, AI105714, AI026280, AI072678, 
AA754198, AA944557, AI045710, C94989, AA471630, AA933231, 
AA257557, AA509328, AA109365, AA963207, AA4 354 73 , ■ AA99930 6 
AI105662, AA509174, AI108597, AA109374, AA925182, AA471671, 
AA752812, AA626989, AA406924, AA840999, AA509033, C94899 
AA842672, AA123619, AA072471, AA509339, AA933108, AI110259, 
N99262, AA661430, AA842512, AA180648, AI066048, AA559515, 
AA109423, AF074736, AA257676, D87312, AA257488, AA114390, 
W06665, W32852, AA123649, AA842505, AA509330, AA908025, AA257279, 
AA406673, AA627018, AA406850, AA161668, AA114453, AA109372, 
AA803997, AA406943, AA257495, AA842019, AA661368, AI082949, 
W06539, AA118223, AA406986, AA471717, AI096212, AA257707, 
AI053126, AA508950, AA713366, W69049, AA965381, AA208680, 

AI058071, AA675858, AA257513, AI083337, AA406908, 
AA842532, AA754036, AA753165, W18199, AI109620, 
AA471431, AA032116, AA754544, AI108146, C42045, 
AA509237, AA161620, AA257665, AA471605, W84932, 
AA471695, AA003471, AA840977, AA509267, AA841711, 
N98079, AA454318, W04102, AA842874, N99754, AA509307, 
AA508954, AA406746, AA842720, W23363, AA508951, 
AI108220, AA508946, AA471497, AA842911, AA842501, 
AA547812, AA752986, AA752003, AA406839, AA508993, 
AA180623, AA508986, AA842627, AA753129, AA509214, 
N74818, AA753300, AA430818, AA417415, AA114426, R46936, 
AA753138, AI058077, AA842464, AA966639, 7^842265, 
W15128, AA660039, AA842275, AA454424, AA180651, 



AA515269 
AA415243 
AA106372 
AA920654 
AA162291 
AA964820 
AA509077 
AA406684 
AA088161 
AA406875 
AA842135 
AA406761 
AI087701 
AI105717 
AA257572 
AA842888 
AA25764 9 
AI057997 
AI083256 
AI058022 
AA843025 
AA841358 
AA471488 
AA751845 
AI068913 
AA123614 
AA161747 
AA406733 
AA979824 
AA509025 



AA161707, 
AA659956, 
AI064026, 
AI109103, 
AA843040, 
AA257699, 
AA661358, 
AA417420, 
AA509218, 
AA509008, 
AI105737, 
AA246112, 
AA601853, 
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iKSSS; £2S?2Ii: SES&'S: SX!JJ'V» SSSSS^SSI' 
2SB?!S: iSSiiS: Silii^^j L s H iC™ sy,: 

AA675813, AI083003, AI021727, AI083274 AI083309; ^2823 AaSs3962 ' 
AA114520, AA430792, C47230, AA842674, AI053141, D75937 i^84303? W6B<»7Q 
AI105681, W91819, AA756971, AA509104 AA111828 AI109720 lillliei ' 
AA933311, AA180602, AA842093, R86419, AI058057 AA186285 SSJSesli' 
AI105725, AA406888, AA649397, N94700, W51718, AA54 791 6 AA84 064 6 [J7 4 1 1 fi 
AA257756, AI082996, W91818, N74830, AA123585 AA118224 ^45444^' ZiAth 
AA509109, AA842387, AI052833, AA751998, AA123634 AA257 3 J? AA803 9 5J ' 
AA842493, AA471448, AA406716, AA841361, AA840661 AA713447' MAoIiIa' 
AA257695, AA841403, AA751834, AA406947, AI096182 AI105734' St 3 W59918 
AA089352, AA180566, AA257427, AA257522, AA406980 AA471469 AA509264 
AA509032, AA749469, AA752028, AA754167, AA754646 AA8 4 2335 ^JIs52' 
AI065957, AI065970, AI096289, AA109379, AA109472 AA114484 AAI6I689' 
AA430830, AA406948, AA433246, AA454371, AA471492 AA471703 ^752086* 
AA842624, AI096185, R47079, R86415, AA161711, Aa1 6 1655, AA180549 a1 2 57424 
AA257437, AA257749, AA433393, AA471419, AA471602, AM30922 tltl'lA^ ' 
AA752034, AA752066, AA842384, AI082936, AI083329 AI105677 Z339l5 
AA109362, AA109417, AA051807, AA430797, AA406799, AA430806' AA495548 
AA661116, AI082951, AI105685, AI114069, R47062, W59884, AA1092 92 AA7 5182 9 
AA754172, AA842257, AI082934, AI082958, AA180706, AA4 71516, AA508962 
AA751563, AA751816, AA753137, AA753167, AA840972 AA842079 AA874756' 
nJo?of?i' AI108824 ' AA180588, AA406676, AA471392, AA509142 AA840909 
■^li^ll' AI066854 ' R47172, AA161565, AA257643, AA753161, AA471447 
AA680450, AA752897, AA753237, AA842230, AA842544, AA161635, AA454428 
AA508933, AA509309, C41215, AA842585, D73786, D75990 D75808 AA62704 9 
N74809, AA257682, AA257517, AA257668, AA471470, AA509251, AA509051 
AA990913, C39627, C41200, C42101, C46071, C48450, AA840970, AA842156 
AA430902, AA842216, AA007706, AA406765, AA180574, AA842310 AI067585 
AI077003, AA756933, AA114501, AA406690, AA454440, AA661371, AA056797' 
AA114372, AA180676, AI082955, AA406735, AA675746, AA752005 W29142 ' 
AA109261, AA842401, AA842602, AA246079, AI043420, AA626993' AA123655 
AA257716, AA406931, AA791379, AI105668, AA842538, R47669, AA088150 ' 
AA109308, AA738555, AA123638, AA180582, AA123597, AA471535 AA180728 
AA161741, AA186201, AA430817, AA433170, AA454407, AA471686' AA16159e' 
AA509110, AA675874, AA471540, Z26577, AA753093, AI066829, D37716 AA627017 
N99292, AA508936, AA257656, AA406768, AA471587, H39287, AA842431 AA161715 
AA509204, AA979935 Ji ' flAibl ' i:) ' 




AA896862, W56907, AA218305, AA020167, W85164, AA017810, W30604 AA541978 
W20894, AA023164, AA760425, AA024084, AA840044, , AA231755 T20644 C45833 
T24185, C68940, D37618, AA964657, AI011924 l^Ub44, C45833, 




R70900, R16693, AA906542, R35112, AA481286, D80100, AA235512 AA960995 
AA867982, D59403, W88874, AA558590, R10048, AA719917, H43573' R49500 ' 
AA744780, AA047136, AA789101, AA906332, AA747301, AA830606 AA434559' 
AA236698, AA328889, N98469, W17299, W46605, AA258082, AI078045 R11930 
W74577, AA703312, AI082727, R45189, N76461, AA010500, W63646 W38891 ' 
AA490651, AA558805, W87891, AA160849, AA618177, AA776126 AA161281 ' 
AA822308, AA499768, AA028780, AA183100, AA276783, AA120227 AA200285' 




SEQ ID NO. : 4, AL031178, Y11905, AF031904, M34309, M29366, L33953, L33952, 
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AA595663, AA827242, AA744475, , AA472485, AA097011 AA959170 
AA986669, AI006628, W12604, AI122356, AA571721 ^433^97 ISfisSt?? 
«Si«n?' AA116290 ' C87721, AA437983, AA990395 ^265925' 5?!?Ss W89262 
AA059703, AA413613, AI020231, AA667024, AA407526 AA221491 tlll'o^l ' 
C76436, AA409700, W35766, , AA859485, H33704, T465W C25899 T41 8 t 2 ' 

SSS, 6 : SX^SSSii: SSiS: SiSo7» 5 S' 7 ^ 5 i Br 3 ' 

C C7 6 ^?3 9 ; S? 7 9 S: ^J 4 6 4 5 25 :^^4^ 98393 ' C 2 377 5 ,^47^!^04^^ 5 ^9972, 

SEQ ID NO.: 5, U29344, S80437, M76767, X62889, M84761 J03S1 4 vcoco 
X13415, X13135, U05714, AC004013, U58675, S47635 Z81533 IcOOB^O 
AC003661, AB008567, Y13444, X96401, AF026487, AF026488 JSSiffS' 
AI037943, AI129371, C15883, AI073336, AA904077 AA56904 2 ' M039«l' 
AA568701, AA058907, AA911112, AA234022, N95359 AA565390 ^SHJ??' 
AA588430, D60358, AA045488, AA084417, N72089, AI14 3390 R51^7 4 aaor 1 4 T<3 
AA635907, T47621, AA579930, AA995057, AA827039, AA872490 ^5883?? 
AA069032, AA062768, AA534011, W96404, H06082, AA938900 AA9^1262 AA836S47 
AA251544, AA250742, AA830405, AA906492, AA102653 R38286 Ja258075 nltlol' 
H38522, R61787, R33742, R24094, AI052406, AA931452 R23634 Itlelo' H833 ° 2 ' 
AA974568, AI028383, AA312451, AA148800, AI138982, R36172 AA92 6 92l' 
AA472563, AA914598, AA423256, AA476186 AA041992 W8493s' Mllllll' ' 
™%n¥&- AA544361 ' AI006075, AA717202 AA008602 ^033399^^72306 
AI037430, AI152943, AA475582, AA471742, AA530292 AA277496 AA718588' 
AA823112, AA822771, AA710973, AA981780, AA718846 AA797557 AM ^7 21' 
AA543891 ' AA606513, AA981758 AA822268 AA41316i; ^47220^' 
AA646527, AA213036, AA168254, AA718549, AA458342 AA119418 AA87992S 
AA050917, AA718814, AA793638, AA177849 AA575627 AA537367' ^llllll' 
AA203946, AA413157, AA517432, AA867736, AI037420 W30436 ^1851 

^957229: £1111*1' AA955881 - AA963915 - AI043663: c ^ 6i: '^llltl: 

SEQ ID NO.: 6, AC002426, AC004674, AC004675, AC004006 L12157 U20fi^Q 
U46028, U20835, AC004058, X67115, X76266, M25485 U59759 AC005172 II7084R 
nn?l 2 ^ 4 ' X14724 ' L3184 °' L41679, , W52480, AA863014 KS6110 1a286755 ' 
W52777 ' ^814246, AA765427, AA873647, AA36057? AlbS 

MBlll FO^f 5 A^7^ n 686 ^;^ 164603 ' W56724 ' AA194905; AA865009: R16884, 
R08110, F07672, AA577790, T11467, AA502489, AA366688, AA480628 AA51631R 
AA090005, AA081908, F01265, AA780686, , AA466811, AA4 65808 AA1 97^4 6 
AI157174, AA153086, AA197667, AA066612 AA058086 AA024186 ^10104 
AA880419, AA870161, AA153880, AA072995, , AI031052 AI043934 ^8^9320 
D15181, U95093, AI096188, T09655, C62926 AA996434 AA925202' Ullllll' 
i^lllt 1 !' R0366 °' AA94898 °' AA"8709, AA899446, AA95?T50 d'sO^SO ^76196, 

SEQ ID NO.: 1, X63547, X63546, AF017306, U44839, U20657 AF017305 
AJ001589, L21998, D21270, Z48245, U63834, Z68006. L04573, 0638^9 AC005266 
AF025468, M94131, , AI056961, C15588, AA373847, AA887911 N87070 R39133 
AA226825, AA353972, AA325352, R88378, AA490675 AA456219 AA^oIJis 
AA046611, AA456224, , AA065652, AA221447, AA710221, AA7174?? AA254049 
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AA041745, AA510261, AA170745, AA656404, AA572506 AA174539 fl wn« n 

AI134284, C58457, C11202, AA413362, AA728532 AA174 539, AA217630, , 




T75882, AA848187, T44109, AA542686, H21339 



SEQ ID NO 
D13438 
AA3818 
AA16675 




SEQ ID NO.: 11, X76498, Z97632, AC004232, AC005184, AC002545, AC003991. 



EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 

All references disclosed herein are incorporated by reference in their entirety. 

We claim: 
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1 - A method of diagnosing a disorder characterized by expression of a human 



Claims 

u,, n , cancer 



associated antigen precursor coded for by a nucleic acid molecule, comprising: 

contacting a biological sample isolated from a subject with an agent that specifically 

binds to the nucleic acid molecule, an expression product thereof, or a fragment of an 

expression product thereof complexed with an HLA molecule, wherein the nucleic acid 

molecule is a NA Group 1 nucleic acid molecule, and 

determining the interaction between the agent and the nucleic acid molecule or the 

expression product as a determination of the disorder. 

2. The method of claim 1, wherein the agent is selected from the group consisting of 

(a) a nucleic acid molecule comprising NA group 1 nucleic acid molecules or a 
fragment thereof, 

(b) a nucleic acid molecule comprising NA group 3 nucleic acid molecules or a 
fragment thereof, 

(c) a nucleic acid molecule comprising NA group 5 nucleic acid molecules or a 
fragment thereof, 

(d) an antibody that binds to an expression product of NA group 1 nucleic acids, 

(e) an antibody that binds to an expression product of NA group 3 nucleic acids, 

(f) an antibody that binds to an expression product of NA group 5 nucleic acids, 

(g) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 1 nucleic acid, 

(h) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 3 nucleic acid, and 

(i) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 5 nucleic acid. 

3. The method of claim 1 , wherein the disorder is characterized by expression of a 
plurality of human cancer associated antigen precursors and wherein the agent is a plurality of 
agents, each of which is specific for a different human cancer associated antigen precursor, 
and wherein said plurality of agents is at least 2, at least 3, at least 4, at least 4, at least 6, at 
least 7, or at least 8, at least 9 or at least 1 0 such agents. 
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4. The method of claims 1-3, wherein the agent is specific for a human cancer associated 
antigen precursor that is selected from the group consisting of breast, gastric, lung, prostate, 
renal, colon, thyroid, Hodgkin's disease, and hepatocarcinoma cancer associated antigen 
precursors. 

5. A method for determining regression, progression or onset of a condition characterized 
by expression of abnormal levels of a protein encoded by a nucleic acid molecule that is a NA 
Group 1 molecule, comprising 

monitoring a sample, from a patient who has or is suspected of having the condition, 
for a parameter selected from the group consisting of 

(i) the protein, 

(ii) a peptide derived from the protein, 

(iii) an antibody which selectively binds the protein or peptide, and 

(iv) cytolytic T cells specific for a complex of the peptide derived from the 
protein and an MHC molecule, 

as a determination of regression, progression or onset of said condition. 

6. The method of claim 5, wherein the sample is a body fluid, a body effusion or a tissue. 

7. The method of claim 5, wherein the step of monitoring comprises contacting the 
sample with a detectable agent selected from the group consisting of 

(a) an antibody which selectively binds the protein of (i), or the peptide of (ii), 

(b) a protein or peptide which binds the antibody of (iii), and 

(c) a cell which presents the complex of the peptide and MHC molecule of (iv). 

8. The method of claim 7, wherein the antibody, the protein, the peptide or the cell is 
labeled with a radioactive label or an enzyme. 

9. The method of claim 5, comprising assaying the sample for the peptide. 

10. The method of claim 5, wherein the nucleic acid molecule is a NA Group 3 molecule. 
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11. The method of claim 5, wherein the nucleic acid molecule is a NA Group 5 molecule. 

12. The method of claim 5, wherein the protein is a plurality of proteins, the parameter is a 
plurality of parameters, each of the plurality of parameters being specific for a different of the 
plurality of proteins, at least one of which is a cancer associated protein encoded by a NA 
Group 1 molecule. 

13. The method of claim 5, wherein the protein is a plurality of proteins, at least one of 
which is kinectin, the remainder of which are non-kinectin cancer associated proteins, and 
wherein the parameter is a plurality of parameters, each of the plurality of parameters being 
specific for a different of the plurality of proteins. 

14. A pharmaceutical preparation for a human subject comprising 

an agent which when administered to the subject enriches selectively the presence of 
complexes of an HLA molecule and a human cancer associated antigen, and 

a pharmaceutical^ acceptable carrier, wherein the human cancer associated antigen is 
a fragment of a human cancer associated antigen precursor encoded by a nucleic acid 
molecule comprises a NA Group 1 molecule. 

1 5. The pharmaceutical preparation of claim 14, wherein the agent comprises a plurality of 
agents, each of which enriches selectively in the subject complexes of an HLA molecule and a 
different human cancer associated antigen, wherein at least one of the human cancer 
associated antigens is encoded by a NA Group 1 molecule. 

16. The pharmaceutical preparation of claim 15, wherein the plurality is at least two, at 
least three, at least four or at least 5 different such agents. 

17. The pharmaceutical preparation of claim 14, wherein the nucleic acid molecule is a 
NA Group 3 nucleic acid molecule. 

1 8. The pharmaceutical preparation of claim 14, wherein the agent comprises a plurality of 
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agents, at least one of which is kinectin, the remainder of which are non-kinectin cancer 
associated proteins, and each of which enriches selectively in the subject complexes of an 
HLA molecule and a different human cancer associated antigen. 

1 9. The pharmaceutical preparation of claim 1 4, wherein the agent is selected from the 
group consisting of 

(1) an isolated polypeptide comprising the human cancer associated antigen, or a 
functional variant thereof, 

(2) an isolated nucleic acid operably linked to a promoter for expressing the isolated 
polypeptide, or functional variant thereof, 

(3) a host cell expressing the isolated polypeptide, or functional variant thereof, and 

(4) isolated complexes of the polypeptide, or functional variant thereof, and an HLA 
molecule. 

20. The pharmaceutical preparation of claims 14-19, further comprising an adjuvant. 

21. The pharmaceutical preparation of claim 1 4, wherein the agent is a cell expressing an 
isolated polypeptide comprising the human cancer associated antigen or a functional variant 
thereof, and wherein the cell is nonproliferative. 

22. The pharmaceutical preparation of claim 1 4, wherein the agent is a cell expressing an 
isolated polypeptide comprising the human cancer associated antigen or a functional variant 
thereof, and wherein the cell expresses an HLA molecule that binds the polypeptide. 

23. The pharmaceutical preparation of claim 21 or 22, wherein the isolated polypeptide 
comprises a kinectin polypeptide. 

24. The pharmaceutical preparation of claim 14, wherein the agent is at least two, at least 
three, at least four or at least five different polypeptides, each coding for a different human 
cancer associated antigen or functional variant thereof, wherein at least one of the human 
cancer associated antigens is encoded by a NA Group 1 molecule. 
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25. The pharmaceutical preparation of claim 24, wherein the at least one of the human 
cancer associated antigens is kinectin or a fragment thereof. 

26. The pharmaceutical preparation of claim 1 4, wherein the agent is a PP Group 2 
polypeptide. 

27. The pharmaceutical preparation of claim 1 4, wherein the agent is a PP Group 3 
polypeptide or a PP Group 4 polypeptide. 

28. The pharmaceutical preparation of claim 22, wherein the cell expresses one or both of 
the polypeptide and HLA molecule recombinantly. 

29. The pharmaceutical preparation of claim 22, wherein the cell is nonproliferative. 

30. A composition comprising 

an isolated agent that binds selectively a PP Group 1 polypeptide. 

31. The composition of matter of claim 30, wherein the agent binds selectively a PP Group 

2 polypeptide. 

32. The composition of matter of claim 30, wherein the agent binds selectively a PP Group 

3 polypeptide. 

33. The composition of matter of claim 30, wherein the agent binds selectively a PP Group 

4 polypeptide. 

34. The composition of matter of claim 30, wherein the agent binds selectively a PP Group 

5 polypeptide. 

35. The composition of claims 30-34, wherein the agent is a plurality of different agents 
that bind selectively at least two, at least three, at least four, or at least five different such 
polypeptides. 



BNSDOCID: <WO_0020587A2_I_> 



WO 00/20587 PCT/US99/22873 

- 67 - 



36. The composition of claim 35, wherein the at least one of polypeptides is kinectin or a 
fragment thereof. 

37. The composition of claims 30-34, wherein the agent is an antibody. 

38. The composition of claim 3 1 5 wherein the agent is an antibody. 

39. A composition of matter comprising 

a conjugate of the agent of claims 30-34 and a therapeutic or diagnostic agent. 

40. A composition of matter comprising 

a conjugate of the agent of claim 35 and a therapeutic or diagnostic agent. 

41 . The composition of matter of claim 39, wherein the conjugate is of the agent and a 
therapeutic or diagnostic that is a toxin. 

42. A pharmaceutical composition comprising an isolated nucleic acid molecule selected 
from the group consisting of NA Group 1 molecules and NA Group 2 molecules, and a 
pharmaceutical ly acceptable carrier. 

43. The pharmaceutical composition of claim 42, wherein the isolated nucleic acid 
molecule comprises a NA Group 3 or NA Group 4 molecule. 

44. The pharmaceutical composition of claim 42, wherein the isolated nucleic acid 
molecule comprises at least two isolated nucleic acid molecules coding for two different 
polypeptides, each polypeptide comprising a different human cancer associated antigen. 

45. The pharmaceutical composition of claim 44, wherein at least one of the polypeptides 
is a kinectin polypeptide. 

46. The pharmaceutical composition of claims 42-45 further comprising an expression 
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vector with a promoter operably linked to the isolated nucleic acid molecule. 

47. The pharmaceutical composition of claims 42-45 further comprising a host cell 
recombinantly expressing the isolated nucleic acid molecule. 

48. A pharmaceutical composition comprising 

an isolated polypeptide comprising a PP Group 1 or a PP Group 2 polypeptide, and 
a pharmaceutically acceptable carrier. 

49. The pharmaceutical composition of claim 48, wherein the isolated polypeptide 
comprises a PP Group 3 or a PP Group 4 polypeptide. 

50. The pharmaceutical composition of claim 48, wherein the isolated polypeptide 
comprises at least two different polypeptides, each comprising a different human cancer 
associated antigen. 

51. The pharmaceutical composition of claim 50, wherein at least one human cancer 
associated antigen is kinectin. 

52. The pharmaceutical composition of claim 48, wherein the isolated polypeptides are PP 
Group 1 1 polypeptides or HLA binding fragments thereof. 

53. The pharmaceutical composition of claim 48, wherein the isolated polypeptides are PP 
Group 12 polypeptides or HLA binding fragments thereof. 

54. The pharmaceutical composition of claims 48-53, further comprising an adjuvant. 

55. An isolated nucleic acid molecule comprising a NA Group 3 molecule. 

56. An isolated nucleic acid molecule comprising a N A Group 4 molecule. 

57. An isolated nucleic acid molecule selected from the group consisting of 
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.(a) a fragment of a nucleic acid molecule having a nucleotide sequence selected from 
the group consisting of nucleotide sequences set forth as SEQ ID NOs. 1-1 1 and 22-46, of 
sufficient length to represent a sequence unique within the human genome, and identifying a 
nucleic acid encoding a human cancer associated antigen precursor, 

(b) complements of (a), 

provided that the fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from the sequence group consisting of 

(1) sequences having the GenBank accession numbers of Table 1, 

(2) complements of (1), and 

(3) fragments of (1) and (2). 

58. The isolated nucleic acid molecule of claim 50, wherein the sequence of contiguous 
nucleotides is selected from the group consisting of: 

( 1 ) at least two contiguous nucleotides nonidentical to the sequence group, 

(2) at least three contiguous nucleotides nonidentical to the sequence group, 

(3) at least four contiguous nucleotides nonidentical to the sequence group, 

(4) at least five contiguous nucleotides nonidentical to the sequence group, 

(5) at least six contiguous nucleotides nonidentical to the sequence group, 

(6) at least seven contiguous nucleotides nonidentical to the sequence group. 

59. The isolated nucleic acid molecule of claim 57, wherein the fragment has a size 
selected from the group consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 14 
nucleotides, 16 nucleotides, 1 8 nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 
nucleotides, 28 nucleotides, 30 nucleotides, 50 nucleotides, 75 nucleotides, 100 nucleotides, 
and 200 nucleotides. 

60. The isolated nucleic acid molecule of claim 57, wherein the molecule encodes a 
polypeptide which, or a fragment of which, binds a human HLA receptor or a human 
antibody. 

61 . An expression vector comprising an isolated nucleic acid molecule of any of claims 
55-60 operably linked to a promoter. 
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62. An expression vector comprising a nucleic acid operably linked to a promoter, wherein 
the nucleic acid is a NA Group 2 molecule. 

63. An expression vector comprising a NA Group 1 or Group 2 molecule and a nucleic 
acid encoding an HLA molecule. 

64. A host cell transformed or transfected with an expression vector of claim 61 . 

65. A host cell transformed or transfected with an expression vector of claims 62 or 63. 

66. A host cell transformed or transfected with an expression vector of claim 61 and 
further comprising a nucleic acid encoding HLA. 

67. A host cell transformed or transfected with an expression vector of claim 62 and 
further comprising a nucleic acid encoding HLA. 

68. An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 55 or 
claim 56. 

69. A fragment of the polypeptide of claim 68 which is immunogenic. 

70. An isolated polypeptide comprising a fragment of a kinectin polypeptide which is 
immunogenic. 

71 . The fragment of claims 69 or 70, wherein the fragment, or a portion of the fragment, 
binds HLA or a human antibody. 

72. An isolated fragment of a human cancer associated antigen precursor which, or portion 
of which, binds HLA or a human antibody, wherein the precursor is encoded by a nucleic acid 
molecule that is a NA Group 1 molecule. 
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73 . The fragment of claim 72, wherein the fragment is part of a complex with HL A. 

74. The fragment of claim 73, wherein the fragment is between 8 and 12 amino acids in 
length. 



75. An isolated polypeptide comprising a fragment of the polypeptide of claim 68 of 
sufficient length to represent a sequence unique within the human genome and identifying a 
polypeptide that is a human cancer associated antigen precursor. 

76. A kit for detecting the presence of the expression of a human cancer associated antigen 
precursor comprising 

a pair of isolated nucleic acid molecules each of which consists essentially of a 
molecule selected from the group consisting of (a) a 12-32 nucleotide contiguous segment of 
the nucleotide sequence of any of the NA Group 1 molecules and (b) complements of ("a"), 
wherein the contiguous segments are nonoverlapping. 

77. The kit of claim 76, wherein the pair of isolated nucleic acid molecules is constructed 
and arranged to selectively amplify an isolated nucleic acid molecule that is a NA Group 3 
molecule. 

78. A method for treating a subject with a disorder characterized by expression of a human 
cancer associated antigen precursor, comprising 

administering to the subject an amount of an agent, which enriches selectively in the 
subject the presence of complexes of an HLA molecule and a human cancer associated 
antigen, effective to ameliorate the disorder, wherein the human cancer associated antigen is a 
fragment of a human cancer associated antigen precursor encoded by a nucleic acid molecule 
selected from the group consisting of 

(a) a nucleic acid molecule comprising NA group 1 nucleic acid molecules, 

(b) a nucleic acid molecule comprising NA group 3 nucleic acid molecules, 

(c) a nucleic acid molecule comprising NA group 5 nucleic acid molecules. 

79. The method of claim 78, wherein the disorder is characterized by expression of a 
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plurality of human cancer associated antigen precursors and wherein the agent is a plurality of 
agents, each of which enriches selectively in the subject the presence of complexes of an HLA 
molecule and a different human cancer associated antigen, wherein at least one of the human 
cancer associated antigens is encoded by aNA Group 1 molecule. 

80. The method of claim 79, wherein at least one of the human cancer associated antigens 
is kinectin or a fragment thereof. 

81. The method of claim 79, wherein the plurality is at least 2, at least 3 , at least 4, or at 
least 5 such agents. 



82. The method of claims 78-8 1 , wherein the agent is an isolated polypeptide selected 
from the group consisting of PP Group 1, PP Group 2, PP Group 3, PP Group 4, and PP 
Group 5. 

83 . The method of claims 78-8 1 , wherein the disorder is cancer. 

84. The method of claims 82, wherein the disorder is cancer. 

85. A method for treating a subject having a condition characterized by expression of a 
human cancer associated antigen precursor in cells of the subject, comprising: 

(i) removing an immunoreactive cell containing sample from the subject, 

(ii) contacting the immunoreactive cell containing sample to the host cell under 
conditions favoring production of cytolytic T cells against a human cancer associated antigen 
which is a fragment of the precursor, 

(iii) introducing the cytolytic T cells to the subject in an amount effective to lyse 
cells which express the human cancer associated antigen, wherein the host cell is transformed 
or transfected with an expression vector comprising an isolated nucleic acid molecule 
operably linked to a promoter, the isolated nucleic acid molecule being selected from the 
group of nucleic acid molecules consisting of NA Group 1 , NA Group 2, NA Group 3, NA 
Group 4, and NA Group 5. 



BNSDOCID: <WO 0020S87A2 I > 



WO 00/20587 PCT/US99/22873 

-73 - 

86. The method of claim 85, wherein the host cell recombinant^ expresses an HLA 
molecule which binds the human cancer associated antigen. 

87. The method of claim 85, wherein the host cell endogenously expresses an HLA 
molecule which binds the human cancer associated antigen. 

88. A method for treating a subject having a condition characterized by expression of a 
human cancer associated antigen precursor in cells of the subject, comprising: 

(i) identifying a nucleic acid molecule expressed by the cells associated with said 
condition, wherein said nucleic acid molecule is a NA Group 1 molecule; 

(ii) transfecting a host cell with a nucleic acid selected from the group consisting 
of (a) the nucleic acid molecule identified, (b) a fragment of the nucleic acid identified which 
includes a segment coding for a human cancer associated antigen, (c) deletions, substitutions 
or additions to (a) or (b), and (d) degenerates of (a), (b), or (c); 

(iii) culturing said transfected host cells to express the transfected nucleic acid 
molecule, and; 

(iv) introducing an amount of said host cells or an extract thereof to the subject 
effective to increase an immune response against the cells of the subject associated with the 
condition. 

89. The method of claim 88, wherein the nucleic acid molecule is a kinectin nucleic acid 
molecule. 

90. The method of claim 88, further comprising identifying an MHC molecule which 
presents a portion of an expression product of the nucleic acid molecule, wherein the host cell 
expresses the same MHC molecule as identified and wherein the host cell presents an MHC 
binding portion of the expression product of the nucleic acid molecule. 

91 . The method of claim 88, wherein the immune response comprises a B-cell response or 
a T cell response. 

92. The method of claim 91, wherein the response is a T-cell response which comprises 
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generation of cytolytic T-cells specific for the host cells presenting the portion of the 
expression product of the nucleic acid molecule or cells of the subject expressing the human 
cancer associated antigen. 

93. The method of claim 88, wherein the nucleic acid molecule is a NA Group 3 molecule. 



94. The method of claims 88 or 90, further comprising treating the host cells to render 
them non-proliferative. 

95. A method for treating or diagnosing or monitoring a subject having a condition 
characterized by expression of an abnormal amount of a protein encoded by a nucleic acid 
molecule that is aNA Group 1 molecule, comprising 

administering to the subject an antibody which specifically binds to the protein or a 
peptide derived therefrom, the antibody being coupled to a therapeutically useful agent, in an 
amount effective to treat the condition. 

96. The method of claim 95, wherein the antibody is a monoclonal antibody. 

97. The method of claim 96, wherein the monoclonal antibody is a chimeric antibody or a 
humanized antibody. 

98. A method for treating a condition characterized by expression in a subject of abnormal 
amounts of a protein encoded by a nucleic acid molecule that is a NA Group 1 nucleic acid 
molecule, comprising 

administering to a subject a pharmaceutical composition of any one of claims 14-29 
and 42-547 in an amount effective to prevent, delay the onset of, or inhibit the condition in the 
subject. 

99. The method of claim 98, wherein the condition is cancer. 

1 00. The method of claim 98, further comprising first identifying that the subject expresses 
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in a tissue abnormal amounts of the protein. 

101. The method of claim 99, further comprising first identifying that the subject expresses 
in a tissue abnormal amounts of the protein. 



1 02. A method for treating a subject having a condition characterized by expression of 
abnormal amounts of a protein encoded by a nucleic acid molecule that is a NA Group 1 
nucleic acid molecule, comprising 

(i) identifying cells from the subject which express abnormal amounts of the protein; 

(ii) isolating a sample of the cells; 

(iii) cultivating the cells, and 

(iv) introducing the cells to the subject in an amount effective to provoke an immune 
response against the cells. 

103. The method of claim 102, further comprising rendering the cells non-proliferative, 
prior to introducing them to the subject. 

1 04. A method for treating a pathological cell condition characterized by aberrant 
expression of a protein encoded by a nucleic acid molecule that is a NA Group 1 nucleic acid 
molecule, comprising 

administering to a subject in need thereof an effective amount of an agent which 
inhibits the expression or activity of the protein. 

1 05 . The method of claim 1 04, wherein the agent is an inhibiting antibody which 
selectively binds to the protein and wherein the antibody is a monoclonal antibody, a chimeric 
antibody, a humanized antibody or an antibody fragment. 

1 06. The method of claim 1 04, wherein the agent is an antisense nucleic acid molecule 
which selectively binds to the nucleic acid molecule which encodes the protein. 

1 07. The method of claim 1 04, wherein the nucleic acid molecule is a NA Group 3 nucleic 
acid molecule. 
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108. The method of claim 104, wherein the nucleic acid molecule is a kinectin nucleic acid 
molecule. 

1 09. A composition of matter useful in stimulating an immune response to a plurality of a 
proteins encoded by nucleic acid molecules that are NA Group 1 molecules, comprising 

a plurality of peptides derived from the amino acid sequences of the proteins, wherein 
the peptides bind to one or more MHC molecules presented on the surface of the cells which 
express an abnormal amount of the protein. 

110. The composition of matter of claim 109, wherein at least a portion of the plurality of 
peptides bind to MHC molecules and elicit a cytolytic response thereto. 

111. The composition of matter of claim 109, wherein at least one of the proteins is 
kinectin. 



1 12. The composition of matter of claim 1 1 0, further comprising an adjuvant. 

113. The composition of matter of claim 112, wherein said adjuvant is a saponin, GM-CSF, 
or an interleukin. 



1 14. The composition of matter of claim 109, further comprising at least one peptide useful 
in stimulating an immune response to at least one protein which is not encoded by nucleic 
acid molecules that are NA Group 1 molecules, wherein the at least one peptide binds to one 
or more MHC molecules. 



115. An isolated antibody which selectively binds to a complex of: 

(i) a peptide derived from a protein encoded by a nucleic acid molecule that is a 
NA Group 1 molecule and 

(ii) and an MHC molecule to which binds the peptide to form the complex, 
wherein the isolated antibody does not bind to (i) or (ii) alone. 
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116. The antibody of claim 1 1 5, wherein the antibody is a monoclonal antibody, a chimeric 
antibody, a humanized antibody, or a fragment thereof. 
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SEQUENCE LISTING 

<110> Ludwig Institute for Cancer Research 

5 <120> Cancer Associated Antigens 

and Uses Therefor 

<130> L0461/7050WO 

10 <140> US 09/166,300 

<141> 1997-10-05 

<140> US 09/166,350 

<141> 1997-10-05 

15 

<160> 45 

<170> FastSEQ for Windows Version 3.0 

20 <210> 1 

<211> 4422 

<212> DNA 

<213> Homo sapiens 



25 <400> 1 

gattattctg gaggaagatg accttcattt tatggttatg tgtgtgtgtc tgtgtgtctt 60 

tggaaaaata tatataattt tttcaaatag gaagccaaca tatcaagtga tgaattaaag 120 

tatgctgcag aatatatatt ctaaaactac aaaaaagtca ctgaatatca aaatgataca 180 

gcttatacat atagttactg tgacaagtga cagactgcta gttcagaatt caaaaatcct 240 

30 ttcctagttt gtgagataat gggctaaatt ccttctgcct gccactgggg caaagcaaat 300 

tgctttagtt tttgatgaga gttcttagaa gtttgttggt attccttcat ccacagcatc 360 

cattgttgaa ataaccattt tcagttgtga tgccttaact aagaagccaa ttgttagcct 420 

gaaatgcaat cttggtagcc agtttcaatg aagctagaga ttagtcagaa aaagttagct 4 80 

gttgggcttt agaaagggat tttgagtcct gncatttcta cttgggagca ttttggagca 540 

35 gattaagctt tcagtataaa aacaagtggc tacctgatgg aaacttttct tacccttata 600 

gggaaactga gcacaagctg aatgatattg tctgctgcaa aaaaaaacaa acaaaaaaaa 660 

aacaaaacaa aaaacaaaaa aaaaaaaaaa aaaacctcgt gccgaattcg gcacgagggg 720 

aagccccgtg caccccccgc cctccggccg ccgccgcccc gctggccctg cagccgtcgc 780 

cgctgcctcg ggctacagcc ccgggctcgg cggtcccggc tggggaagga gggcggcgag 84 0 

40 cgcgtccgga gccgccggag atggcgggag ggcactgcgg cagcttcccc gcggcggcgg 900 

ccggcagcgg cgagatcgtc caactgaacg taggggggac cagatttagt acctcaagac 960 

aaactcttat gtggattcca gattcttttt tttccagttt gctgagtggg agaatttcaa 1020 

cacttcgaga tgaaactggt gctatattta ttgatagaga tccagcagca tttgcaccca 1080 

ttttaaattt tcttcggaca aaagaactag acttaagggg agtgagtatt aatgttctca 1140 

45 ggcatgaagc agaattttac gggatcactc cattagtaag aaggcttctc ttatgtgaag 1200 

aattggagcg ttcctcttgt ggcagtgtcc tttttcatgg ttacttgccc ccaccaggta 1260 

ttcctagtcg taaaataaac aacacagtca gatctgctga ttctaggaat ggtctaaatt 1320 

ctacagaagg tgaagcccgg ggaaatggta cacagcctgt tctctctgga acgggagaag 1380 

aaactgttag gctaggattt cctgtggatc cacgaaaggt gctaatagta gctggccatc 14 40 

50 acaactggat tgtagctgca tatgcccatt ttgctgngtg gtacagaatc aaagaatctt 1500 

caggatggca gcaagtgttt acgagcccat atttggattg gactatcgaa cgagtagctt 1560 

taaatgcaaa ggtggttgga gggccacatg gagacaaaga caaaatggtt gctgttgcct 1620 

cagagagtag catcatcttg tggagtgttc aggatggggg aagtggaagt gaaattggag 1680 

tgttcagcct gggtgttcct gtagatgctc tcttctttat tggtaaccag ttggtggcca 1740 

55 cgagtcatac agggaaagtg ggagtgtgga atgctgtcac tcagcactgg caggttcaag 1800 

atgttgttcc tataactagt tatgacactg ctggatcatt ccttctgctt ggatgtaaca 1860 

atggatcaat atattacata gatatgcaga agttcccctt gcgaatgaaa gataatgatc 1920 

ttcttgtaac tgaactgtat catgatcctt caaatgatgc tattactgct ctgagtgttt 1980 

acctcacacc caaaacaagt gtcagtggta actggatcga gatcgcctat ggtacgagct 2040 

60 ctggagcagt acgagtgatt gtacaacacc cagagacagt tgggtcaggt cctcagcttt 2100 

ttcagacttt cacagttcac cgaagtcccg taacaaaaat catgctatca gagaagcatc 2160 

ttgtatcagt ctgtgcagat aataatcatg tccggacgtg gacagtaaca cgattcngag 2220 

gaatgatctc tactcagcca ggttctactc ctttagcgtc attcaagata ctatccctgg 2280 

aggagacaga aagtcatggt agctattcct ctggaaatga cataggacct tttggagagc 2340 

65 gagacgatca acaggtgttt atccagaaag ttgttcccat caccaacaaa ctatttgtaa 2400 

gactctcatc gactggaaaa agaatatgtg agatccaggc tgttgactgt actacaatat 24 60 

cctcatttac agtgagggaa tgtgagggat ccagtaggat gggctcaaga ccaaggcgct 2520 
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10 



15 



20 



25 



30 



acttgttcac 
atatggttaa 
aattactcga 
caacttccgt 
accatgatac 
ctttattagc 
ttaatttgaa 
aagctgaagt 
tagaaataaa 
gtttttcaga 
ggaagaaagg 
catcatcacc 
ctacaaagac 
gcttgtgaaa 
tttactgaat 
catttagtat 
ttgacatttt 
ggagtccatt 
taggatggtg 
cctaatcttc 
ttgtctttgg 
tgatgaggtg 
aataccagat 
ttttctgatt 
ccaaaattcc 
aaaataataa 
acctgtcttg 
ttaaattaaa 
gttttggctg 
gtttttgtac 
aattgtaaat 
aaatataagt 



aggccataca 
caaaagtgaa 
tcaatgtgat 
agttcagcat 
cacccatgaa 
aagggcaaga 
cagaaatgta 
gaaaggggca 
aagtttgaga 
atccaagaaa 
aggatttgaa 
aagtacttcc 
tactccatct 
actcaccaaa 
ttcagtacat 
ctttttaaca 
ggaaattttt 
ttaacactta 
cccatatagg 
attatgcctg 
aacttggtct 
ctatgtttgt 
accactatgt 
ctcatcattg 
attgaaactt 
tcaagggcaa 
tttcaggagc 
acnctagttt 
ctgtaagaat 
aatttttttt 
ttaaagcatt 
gagtgaagtt 



aatggcagta 
gataaggatg 
ttgagcacat 
agccacttac 
gcagctactt 
aggactgaga 
gaaagagctg 
acaggggaat 
gaattggata 
aggtcatcag 
gggggaggat 
gatggaggaa 
cctcggcata 
atgaatagtt 
tagtttttac 
gaattacttg 
ttaattttac 
ccgacttttt 
tgagcatccc 
agacttgtct 
cccaacactt 
gaaaaacata 
agtaagtctt 
ggagatctta 
tcaagttgga 
agctttgagt 
caccatgttt 
gttcacttgt 
gtgaatttga 
catttaattg 
agcatttatt 
gaaataaaat 



ttcaaatgtg 

taggtggtcc 

ctcgctgtgc 

gagaatcaaa 

acggttccat 

gctttcacag 

tccctgaaaa 

gtaatatatc 

gtggattgga 

aagatgaaaa 

tccttggaag 

ctgactcacc 

aaaaaagtga 

gtttcggtta 

actaaaactt 

gaataatgag 

aagtacattt 

ttggtttgga 

tttagatcat 

tacaatgtta 

attgtgattg 

tcatgtaatt 

ttaggatgat 

aatcttagca 

gcaattgtct 

gcccagaagg 

ttttttcagt 

aggactgcag 

atgtattttg 

gaatgatctt 

ggtgaataat 

ctttaaaatt 



ggatctgacc 
aaccgaagaa 
tactcctaac 
ttctagcctt 
gaggccttac 
ttatagggac 
tggtaacttg 
tgagagaaag 
agtgcataaa 
tgaaaataaa 
aaagaaagtt 
tggtactgcg 
ttcttcaggt 
cat ttagatg 
tacaagataa 
atacaataat 
aacagatcat 
aacatattac 
gggaaccagc 
cctttaagtg 
caaagtgttt 
caaaaacact 
tttaatttag 
agcattagca 
gtgtttgaaa 
gaaagctgta 
gttancaaca 
ttctgaattt 
aattgtaaga 
caggtttcta 
gtatatatcc 
ta 



actgctatgg 
gagctactca 
atcagtccag 
cagcttcagc 
agagaaagtc 
ttccagacta 
ggtccaatac 
tctcctggag 
atagctgaag 
atagagttta 
ccctatctgg 
tccccatctc 
caggagtaca 
aaagttaaac 
aattggactt 
catatctctt 
ttataaagca 
cacgtcttaa 
agactgcatt 
aatcacataa 
accagatatt 
attgatattg 
tcgtgcgtca 
atattaaatg 
agatgaaata 
ccagttgcta 
atcatgataa 
tgggttaaag 
gcaaaagaac 
caaatagggt 
ccattccaag 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4422 



35 



<210> 2 

<211> 502 

<212> DNA 

<213> Homo sapiens 



40 



45 



50 



55 



60 



65 



<400 
gctagcttcg 
cctgttcttt 
tgctttcctc 
gccatttacc 
caaaatacca 
gttggataag 
ctcaagcagt 
attgcttggc 
atgtgtttct 



> 2 
gcgcggatcc 
gcgccagcgg 
aaaatgagtg 
cttcatagga 
gctgttactt 
tggaaaacta 
aaggatgaag 
agagaagtac 
aacacagcaa 



ctgggcgtcc 
gaaccactat 
ttagtgtcaa 
agagaaataa 
atcctaaaaa 
ccatgaaatc 
atcctctagc 
cagaacacat 
aa 



<210> 3 

<211> 1948 

<212> DNA 

<213> Homo sapiens 



<400> 
ccatgtgagg 
ttcctccagc 
gttgtgaccg 
tttttcccag 
caagattacc 
aaggttgctg 
aaccttggac 
agcgagaaga 
agagaagccc 
aagcccaggc 
tctccccgtt 
caggcgggcc 



gtacgtcgga 
ctctgcactc 
tttcttcaga 
cttaacaatt 
tgagagtaca 
tagtgtgcaa 
tgccaccaga 
cactgaagaa 



gaggggcccg 
agtgtgaccg 
tgtatgagtt 
ctatatataa 
tcagtagttc 
agtcattagg 
ccaagcacct 
gggaacgtga 
tgtccaacga 
ccttgcatgc 
ctgaagaaag 
cgcagcttca 



ctcctgcttg 
ggaggatctg 
tcttctgatg 
ggaatttgaa 
tggtctgtgt 
aatcacagaa 
cagccgagac 
ggctgcggag 
taccactgaa 
tttgcccgct 
ccatacaacg 
ggcactggct 



gtgacagagt 
gtggaattgg 
aaggttgaaa 
gagttgcata 
tcccaggaga 
ttcctacgga 
atggatgggg 
gagggactgg 
tctcttgctg 
ggtttttccc 
acggtttctg 
aacttagaag 



gtccttcgtc 
ctggggtttt 
cctttcacca 
ttgcagagat 
cgcccttctg 
gaagaatgtg 
gagttcattg 
gagctcaaaa 



ctccagggtc 
gttacatggc 
ggtttttggt 
acatgtcttc 
aagagctaga 
tttcaacaat 
agatgtggag 
cccttatgga 



cagcacgcgg 
ctctgcctca 
aagatcatct 
aaatggttaa 
ccctggaaat 
agaaagaaat 
agcagctaga 
cctcagtgaa 
ccaacagcag 
ctccagtaaa 
gtggcaatgg 
ccaggagggg 



tggcctgcag 
gctggctcag 
agcaaagcct 
gaaaatgtgc 
aaacaatgat 
acacccagac 
gggagctagc 
aaggcccaga 
aggccgggag 
tgtgactgtc 
gagcgtgttc 
gtctataggt 



60 
120 
180 
240 
300 
360 
420 
480 
502 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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15 



20 



gctgctctct 
aggaggctga 
tcagaccagg 
ggtggtggag 
ggcaaggttt 
acagaggtgg 
tgtgcctaca 
actgaagggt 
gtgcactctg 
aaatttccaa 
cttccgatgg 
tcacagtgac 
agaagctgtg 
agattcgtct 
gaatat tgca 
acatttttct 
ggcgcgtctg 
gatgattggc 
cttgctccat 
tgggcagcat 
attaaattaa 



catcccggga 
cccaggcaca 
acacctggga 
cagagtccct 
gtgacttcca 
cagcccctcc 
ggactgtgcc 
gtctcagaag 
gccagagaga 
tgggagccat 
gcttatcttg 
tgatgcagag 
gagtcggtcg 
gattttatcc 
cagatgaacc 
atggagcttg 
acgctcatga 
cctgctccgt 
actagaagct 
tttattgata 
attaaattaa 



tgtcagtggg 
ggtggcagcg 
cagcctgagg 
gccgcctggg 
cctgaaccac 
gcttgagaaa 
caagccaggg 
cctttcgggg 
actggagagc 
gagttactgt 
tcccctccag 
gggcgtgcct 
tcaccgcgag 
agagaaggtc 
ttttatttat 
tcataataat 
gtttttcatg 
ttccqttctc 
tcttttgggt 
gaagggggaa 
tttaaaaa 



ctgcctgttt 
tttcctggag 
agcccgggtt 
gggcctggac 
cagcagccca 
attttgtctg 
cctcagcctg 
gacttgaacc 
gtggttgctg 
ctcagggaca 
gtacaatagt 
gcggatgggc 
agccctcaca 
tatggcaagc 
aaagaataat 
agcagatatt 
atggtgatga 
tcctgggaga 
tcaattaaaa 
aaagtcattt 



atgctcagtc 
agaatgcttt 
tctgcagccc 
atgcagaggc 
gccccaccag 
tggatagcgt 
gcccacatgg 
ggttcccctg 
tcggcgaagc 
gaagcagatt 
gtctcaggag 
ccgctagaag 
gtgaagtgga 
aatgtatatt 
gtctttctgc 
acct gatcag 
gtagcactgc 
tatgctgctt 
agaaaataag 
ctacttgcat 



aggagagcct 
ggaacactct 
tttgtcatct 
aggacacctc 
cgtcctgcct 
ggcagtggac 
atcactattg 
tgggatggag 
catggctttg 
tttattcaga 
gaggacattg 
gagttcctct 
gtcagatcct 
tttctaatgt 
cctgctgtct 
gaatccctgt 
actgtcacct 
ttccaccaga 
ctagtcattc 
gattttttaa 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1948 



25 



<210> 4 

<211> 1486 

<212> DNA 

<213> Homo sapiens 



30 



35 



40 



45 



50 



55 



60 



65 



<400 
ccgcggcggc 
cctcggctgg 
gcgcgccccg 
tattgccgct 
caggcaggag 
gcacggagag 
attgtcattc 
cagatctgca 
gtgtaagctc 
aaacatcggc 
tcttcctaaa 
atcgtagggc 
cagatggtga 
cagatctctt 
gtcagcctga 
tgtacatctt 
cgctggtgtg 
gcttgaaagt 
tgtttttaga 
ttcgtcaagg 
acaggtacat 
ctcagtccat 
gggtcactat 
actaagaaaa 
tccctgtacc 



> 4 
gtccggggtc 
caacgggcgt 
atctggcccc 
gcctggtgcg 
tagacacccg 
cccctcgagc 
tgatactgtc 
ggcacaactc 
tagcaattta 
agataccaaa 
agcagtagaa 
tatgcaactt 
tggcgttgga 
gtcctacttc 
gcttgagagc 
ccgatgggtg 
cagaggattc 
ttggggcaga 
acggcctcgt 
ggaacagtct 
aagattcttt 
tgttccacgt 
cgcttggcac 
aaaggaagaa 
aaagaagccg 



tccagtaggg 
ccctccactc 
ctgccccgcg 
cttctccgac 
gacacccagc 
gcagcaggcc 
agagcagatg 
cagatgttcc 
gaaaatcgac 
ggaaaacaag 
gaagaacaaa 
gtacctgata 
aacagctaca 
cagcagcaac 
agtcagattc 
gtgtctagtg 
tacatctgtg 
agctgtatta 
gttcggtttg 
cttgatggtt 
cctgatggcc 
ttaagaacta 
aagacacaga 
aaaccacttg 
aatcagaagt 



<210> 5 

<211> 774 

<212> DNA 

<213> Homo sapiens 



<400> 
tgcgccaccc 
ggaccctgtc 
gtcccaccca 
act cctggtc 
gcgcccgtgg 
tggtagagac 
gcactctgcc 



caccccaccc 
cggcccagtg 
cccgccgcct 
tgtgaagagc 
ggtctgcgct 
ccccaggcct 
cccatttccc 



cacccccgcc 
ccactgcccc 
cccacggcac 
cggtggtgcc 
tggtctttct 
gtccaccctg 
cgatgtcccc 



ctgacgctcc 
cccgagtccc 
aagatggctg 
cgagaactct 
acccctcctc 
gccccgccag 
atgatgaaga 
gagctcagtg 
cttgcagagc 
aacaggcaaa 
atggagctct 
tagagttcaa 
ttgaagataa 
tcacatttca 
acatatcagt 
acttggacct 
ccagagaccc 
aacttgttcc 
atggcgtgta 
tctatagagc 
atgtgatgat 
gggaatacca 
ccatcagacc 
gactataaat 
tttcatgnng 



ggtgctcgca 
cggcagccgc 
ccgtacgccg 
gctaagctcc 
cggggggcgg 
catggcagaa 
aaatgaaagt 
gatgtttgaa 
agcaagaggc 
agaagaaaag 
ctatgaagcc 
gattacttat 
tgatgatgac 
ggagtctgtg 
gctgccaatg 
cagatcattg 
tgaaatatgg 
gtacacgtcc 
tatcagtaaa 
ctggcaccaa 
gttgacaacc 
ggactgatgc 
caaagtattt 
acccgatatt 
gggctn 



caatcccccg 
cgccacccca 
ggcccgcagt 
gctgcagaga 
tgcagagggg 
gctgaggaag 
cctgctgaaa 
cttgctccag 
tctctccaga 
gctcgagaac 
atcaagtt tt 
acccggtctc 
agcaaaatgg 
cttaaactgt 
gaggt cctga 
gagcagttgt 
cgtctggcct 
tggagagaga 
accacatat a 
gtggaatatt 
cctgaagagc 
caattctact 
tgctgtaata 
ttcgt cgggg 



atgcaacggg 
ccgaggctgc 
ctcggggaca 
cgtgcccgca 
gtgcttggat 
ccaagactcc 
tgcgggcgcg 



attgaagggt 
tagacgtagg 
ccagagctgc 
ggaactgggc 
ttgcatattt 
tcaggcagcg 
ggcagccacc 



cctgccggtg 
tgttaggcat 
cgacttggag 
tgggcctcgt 
attgcattgc 
tgtgggtccc 
caagcctgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1486 



60 
120 
180 
240 
300 
360 
420 
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ggctgcggcc 
agtccccgag 
cccgcacagc 
ttggtgtttg 
atttactgta 
aaaaatgctg 



ccctctcggc 
gagctgggcc 
cacccgtggg 
tctgtgtttg 
actgtcagtg 
ctctcagcct 



caggcattgg 
cctgcacagg 
gtgctgccct 
tttttcaaga 
tacacgtctg 
cccacaatta 



ctcagcccgc 
cacacagggc 
tatgcccggc 
aatgattcaa 
gaccccgttt 
aaccgcatgt 



tgagtggggg 
ccggccacac 
gccgggcacc 
attgctgctt 
catttttaca 
gatctccaaa 



gtcgtgggcc 
ccagcggccc 
aactccatgt 
ggattttgaa 
ccaatttggt 
aaaa 



480 
540 
600 
660 
720 
774 
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<210> 6 

<211> 2936 

<212> DNA 

<213> Homo sapiens 



<400 
cgtaaccctt 
tgtggaggtg 
gcgggt tgca 
ttatgaccta 
aggagtggga 
tacatggttg 
ggagacagtg 
ttacgatgtg 
cagaatgaac 
cagcaggaaa 
aaagatttag 
cccaaataca 
aggtagaaaa 
cagaatttga 
atgaatcaga 
atgttgatct 
tgtgtcaaaa 
attctgaggc 
aggctacagt 
tggaaaaaga 
aagctgaacg 
aactggaaat 
aaaggaataa 
cagctgttga 
tacaagctgc 
aactcgagaa 
aatctgaaaa 
ccaggaagga 
atgcagagaa 
tccaagtgac 
aggaaatggt 
cgcaactaga 
ataagcttca 
tgcagcgagc 
gactacaaga 
ggtcttaaat 
gatatacaga 
acagcatcta 
ccatttcctc 
agactagaag 
gaatgatgtt 
ttgacatttt 
gcctcaagaa 
tacaagtgaa 
gagaatgtaa 
attttttttt 
agatgcttgg 
agttgaaaaa 
aaaggctacc 



> 6 
agtcccaatg 
tccccggtgg 
aggagcgtgc 
ctttagagga 
aatacccttg 
tcagcacatt 
gattgacagg 
agcatcataa 
atgtaaagtt 
aacttcagct 
aagaaatgaa 
acaagaatta 
gtatagagct 
acaccagaag 
gattgcaaga 
cacaaaagac 
attaaaaggt 
tcaggtggaa 
cagatccctg 
gctacaatca 
agaaataaat 
aacagacatc 
gcttcaaagt 
acatcacaaa 
caaagaagaa 
cagattagca 
ggatcagtat 
gcttcagagt 
ggaaaaaaat 
ttcacttgca 
ggagagatta 
agctgaaaag 
tgaccgtatc 
tgcaattgcc 
gaaagtagaa 
agacaaaatg 
gaagacataa 
tcaatcctgt 
ctcatatgca 
aactggaaac 
cttggagaac 
aacctgtggc 
aaggtaccta 
gcattaaaag 
aacaaaaagg 
ttggcatttt 
ccataacagg 
agatgacaat 
ttttctaaaa 



cctccgtaag 
cgagcgcggc 
gtgcgtctgg 
aggtataatt 
gaaagaaaac 
taccgatatg 
ttctcagtcg 
agctaattat 
acaaaatgaa 
cctgcttgaa 
actgcagata 
gaaactccaa 
gtatataata 
gaagagtatg 
ctggaggaag 
agcaaacgag 
ttagaggctg 
aatgcccaaa 
ggggctgaaa 
agcagtgaac 
acattgtcca 
aaactggaga 
gaactggatg 
gtgctcttag 
ggttatcaaa 
gatttggaga 
gaagagaaat 
gttaggttaa 
gaaaattctg 
cagtcagaga 
aaacaagaat 
acattggaag 
acagacagag 
cagaaaaaga 
gtcttggagg 
ttcctttgaa 
tgaat ttcga 
tagctttcag 
ggaggaacaa 
aacacaaaga 
aggcagatca 
atttagatac 
ctacatgctg 
agaaattctc 
aactagttag 
gagtgaaata 
gtccatacac 
ttaaaagtca 
tctgtgtgca 



cggagttgag 
cagaactgcg 
tatagggagg 
tggct tcctg 
taaaacagta 
gacacttttc 
gagttccaga 
cagacactga 
ctcaagcacc 
gaactaagag 
ttaactccca 
tgagagaacg 
agcttcgcta 
cacgtatttt 
ataaagaaga 
tggaacaact 
aagtagcgga 
gaatacaggt 
aacaatcagc 
aaaatacctt 
gtaaagtaaa 
cagcaagagc 
gattacagtc 
tagaaaagga 
aacttgtggt 
aaatgaaagt 
tgcgggcttc 
aacttcagca 
acctaaaaca 
atgacttgct 
gccgaaattt 
agaaacagat 
aagaaaagta 
gaaaatctct 
caaagaaaga 
gactatacaa 
agtctaattt 
tcatcagcca 
catcaaaggg 
aaacaactag 
aaagaggtga 
tttttactgt 
tattgtatga 
agagatattt 
agtcaagttt 
taactatcat 
atcttctggt 
gcctaaagaa 
cataat taaa 



tgggtgcctg 
gtcacttaag 
acacctctgg 
agattctgcc 
agaaagccaa 
ccaataat tt 
aaatgttaat 
aggctgaaca 
tgtttaatga 
gagaattagt 
aaaattggaa 
ttttaggaat 
tgaacataca 
agatgaagga 
actacgtaac 
tgctcgagaa 
attaaaggct 
gcggcagttg 
taatt tacgg 
tttaattaat 
agaacttaaa 
taagagtgag 
agacaatgaa 
tcgtgaatta 
attacaagat 
ggaacatgat 
acagatggca 
acaaattgtg 
gcaaatcagt 
gaattcaaac 
tagaagccaa 
acagtggttg 
caatcaagct 
tcatgaaaac 
agaat tggaa 
ggcttcaaaa 
tggttcctaa 
tggttccaag 
aactctctct 
aggaacttgg 
agttggtgac 
ttgccaaaac 
ctgtcaggat 
agaatatttg 
taaaattttt 
taattctctc 
ttactatata 
tgtaaaggta 
gagcttaatt 



tggttggagc 
ttttccgtgt 
attgaggatc 
ttagcaagaa 
aacttatttt 
tcctcctggt 
tgatgaaagg 
cacaaggttg 
aaagcaaact 
agagaaaact 
ttgctaagag 
ctagatgaag 
tttctcaagt 
aaaataaaat 
cagctgctta 
aaagtctatt 
gaaaaggaga 
gctgagatgc 
gcagaacgct 
aaattgcata 
cattcaaaca 
ctagaaagag 
attctcaaag 
atacgtaaag 
gaaaagttag 
gtctggaggc 
gaagagatca 
actattgaaa 
agtttgcaga 
caaatgctga 
gctgaaaaag 
gaagaaaagc 
aaggagaaac 
aaattgaaaa 
acagaaatca 
aagactaaaa 
catgcctcca 
catggaact a 
acttcgcaaa 
atcttccgga 
tcagtaaaac 
acttgaatgt 
tttaagatta 
acaatggttt 
actttgttga 
ttcatctttg 
caaaaactgt 
tctatataca 
tttaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2936 



<210> 7 

<211> 1387 

<212> DNA 

<213> Homo sapiens 
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15 



20 



25 



<400 
aaaggctaca 
tggaagagac 
agcaaacctg 
ggaattattt 
gtctaacatc 
attagcctat 
atctcaaagg 
ctaccttact 
acaacacctg 
tatagcaaat 
tctaagcaat 
acagccactg 
tcccattggt 
gagtggaact 
tcccaggttt 
tggtcttcat 
tgatgggcga 
tagatcaatt 
atgtgggcat 
ggacagttat 
gtatggacta 
tctctgtgga 
gaactttcct 
tgtgcca 



> 7 
tcattaacac 
tatgaaatgg 
gccttaccta 
ccccgctatc 
tgggtgaata 
acaggctgt t 
ctgcgcatta 
cttctggatg 
gtaattgaag 
agtagtaaaa 
ctgggaaaca 
acacagtatt 
atgaaggggc 
cagaagaatg 
aatgggtttc 
gaagatctta 
ccagactggg 
gttgtggatt 
ataagtgtcc 
atgcacttag 
agactgaata 
cttaattcag 
caggacaacc 



tagaaggagg 
t cccagaacc 
gaccagttat 
ttctcttcct 
tgggaaatgt 
ttagtcgaat 
aagaggaaga 
atgaggatca 
ttcgcaacaa 
tagatagaca 
catgcttcat 
ttatctcagg 
atatggctaa 
ttgccccatt 
agcaacagga 
atcgagtcca 
aagtagctgc 
tgttccatgg 
gatttgaccc 
aaataacagt 
tggatgaaaa 
aacaaatcct 
caaaaagtac 



acgattaaaa 
tgtgtggaga 
caagaacagc 
gagacagcag 
accttctccg 
gcagaccatc 
tatgcgcctg 
taaattggaa 
agatatgagt 
caaggttccc 
gaactcaagc 
gagacatctt 
atgctatggt 
aaagcttcgg 
ctcccaagaa 
tgaaaagcca 
agaggcctgg 
gcagctaaga 
tttcaatttt 
gattaagtta 
gtacacaggt 
tctagcagaa 
cgaacttctc 



cgaactccac 
gcactttatc 
aagacagaca 
cctgccactc 
aatgcacctt 
aaggaaattc 
tggctataca 
tatttgaaaa 
tggcctgagg 
acagaaaagg 
atccagtgtg 
tatgaactca 
gatttagtgc 
tggaccatag 
cttctggctt 
tatgtggaac 
gacaaccatc 
tctcaagtaa 
ttgtctttgc 
gatggtacta 
ttaaaaaaac 
gtacatggtt 
agtgaagtgg 



agctgat t ca 
actggtatgg 
tcccagagct 
ggacacagca 
taaagcgggt 
acgaatatct 
acagtgagaa 
tccaggatga 
agatgtcttt 
gagccacagg 
ttagtaacac 
acaggacaaa 
aggaactttg 
caaaatatgc 
ttctcttgga 
tgaaggacag 
taagaagaaa 
aatgcaagac 
cactaccaat 
cccctgtacg 
agctgagtga 
ccaacataaa 
gattttttgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1387 



30 



<210> 8 

<211> 749 

<212> DNA 

<213> Homo sapiens 



35 



40 



45 



<400 
gcccaacatg 
ctactgcgcc 
cggctcgatc 
tggctgtcag 
catctgtcac 
actatgtatt 
taat tgaagg 
ctggtaaaac 
gtattatttc 
caattaaaaa 
ataatgaaga 
aatcaaatat 
aacacgtact 



> 8 
gctggagcgc 
cgagtcccgc 
gctgcccgcc 
aataagacca 
accaggagag 
tgacattgac 
ttgctttgaa 
atacacaatg 
tcgagctgtt 
tgggcttcct 
ggtccttgac 
aagaattcat 
gtgaatacag 



ggcggaggtg 
ggccccagtg 
gccagcatgt 
cagcttgcca 
cctcaggtct 
tcccagcaag 
ggatacaatg 
ggaacaggat 
aaacaccttt 
gctccagatt 
ttatttgata 
gaagattcaa 
aatcnagag 



agccggccgc 
gcgcctcagc 
tgggcgcccc 
aagagaagat 
tcctagggaa 
agcagatcta 
ctacagtttt 
ttgatgttaa 
ttaagagtat 
ttaaagtgaa 
ccactcgtga 
ctggagggaa 



ccgcccgcag 
tctgcggtgc 
ggacgagagc 
tgaaggatgc 
aga taaggct 
cat tcaatgt 
tgcttatgga 
cattgttgag 
tgaagaaaaa 
tgcccaattc 
tattgatgca 
tttatactgt 



acgccccagc 
cgaggcccaa 
t ccgtgcggg 
catatttgta 
tttacttttg 
atagaaaaac 
caaactggag 
gaagaactgg 
aaacacatag 
ttagagctct 
aaaagtaaaa 
gggcgtttcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
749 



50 



<210> 9 

<211> 686 

<212> DNA 

<213> Homo sapiens 



55 



60 



65 



<400 
tggaaattat 
aagnaaaaaa 
gaaaagggtg 
gagtgatcat 
tgaaagttct 
ggcaaacatt 
ccagaaaaga 
taaaattcgg 
atcttactca 
catgaacaaa 
aaatcagtct 
aaaaacaaag 



> 9 
agacctagca 
gtgtggctgg 
tttcggaaag 
gaggatgaag 
gatgaatcag 
acttgtgaaa 
ctgcagactc 
gatactcagc 
gaagaaaaag 
gaactgcaga 
cagtatgaaa 
gttcgcctaa 



aaaaaagatt 
taaagaggat 
agaaacccaa 
aagaggagga 
attctgaatc 
ttgcaattaa 
tgaaaaagca 
ttgaaagaga 
caaaaaaagt 
gacttcaagc 
agcnattgaa 
tgaaaa 



tagagaaggt 
aatacagaca 
tgaattagaa 
ggaggaggag 
agatgaaaaa 
gcaaaagctg 
gtatgaagag 
ccaggtgctt 
taggtctgaa 
agctcaaaaa 
gaaattgcag 



tgnaaagaaa 
ctgaccaaga 
gtggaagaaa 
gaagatgaca 
gccnattatc 
attgatgaac 
aagntaatga 
caaaacttag 
tatgaaaaga 
gaacatgcaa 
caggatgtga 



agaaaagagg 
gaagaaagaa 
gtcaagaagt 
ttgatggggg 
aagcagact t 
tagaaaacag 
tgctgcaaca 
gctcggtaga 
aactccaagc 
ggttgcttaa 
tggaaatgaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
686 



<210> 10 
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10 



20 



25 



30 



60 
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6 

<211> 833 
<212> DNA 
<213> Homo sapiens 

<400> 10 

gttcttctgt cgccggcttc agcagcccgc gcccgggcag gaatagaaga tgaacaaacc 60 

cataacacca tcaacatatg tgcgctgcct caatgttgga ctaattagga agctgtcaga 120 

ttttattgat cctcaagaag gatggaagaa gttagctgta gctattaaaa aaccatctgg 180 

tgatgataga tacaatcagt ttcacataag gagatttgaa gcattacttc aaactggaaa 240 

aagtcccact tctgaattac tgtttgactg gggcaccaca aattgcacag ttggtgatct 300 

tgtggatctt ttgatccaaa atgaattttt tgctcctgcg agtcttttgc tcccagatgc 360 

tgttcccaaa actgctaata cactaccttc taaagaagct ataacagttc agcaaaaaca 420 

gatgcctttc tgtgacaaag acaggacatt gatgacacct gtgcagaatc ttgaacaaag 4 80 

ctatatgcca cctgactcct caagtccaga aaataaaagt ttagaagtta gtgatacacg 540 

ttttcacagt ttttcatttt atgaattgaa gaatgtcaca aataactttg atgaacgacc 600 

catttctgtt ggtggtaata aaatgggaga gggaggattt ggagttgtat ataaagggct 660 

acgtaaataa cacaactgtg gcagtgaaag aa-gcttgcag caatggttga cattactact 720 

gaaggaactg aaaccagcag tttgatccaa gaaaataaaa gtaatgggca aaagtggtca 780 

accatggaaa aactttagta ggaacctact tgggttttct caagtggatg gga 833 

<210> 11 

<211> 799 

<212> DNA 

<213> Homo sapiens 



<400> 11 

taaaaatatc cccttngatg atacctgcca ataatgatat gtcccattat tagattatgt 60 

tacatgccaa agtttaaagg aatttgggca gatccagtta aggttcctta aacaacntca 120 

ctttgagact gttgaaaggg cctgacctaa tccaagtgaa ccccttgcaa gaagaattct 180 

ccttgtaagc cttgaagaag tatgtgagag ggccacattg gctaaaacct aaaggtggcc 240 

tctaggagat gagacctacc ttccagttgt cagcaagcag gaaaaaaaaa ttgggacctc 300 

agttgcaacc acaaggaact gaattctgcc aaaaatntga gtcagcttag aagagtactc 360 

caagcttcag atgataacca cagcctgggc tgacacctgg atttagcttt gcatgatcct 420 

cagtatgaga atctatctgt tctgtgctgg acttctaata tatagaactg tgagataatg 4 80 

35 ggtcacattg gctggatgtg gtggctcata cctgtaaatc ccagcacttt gggaggccga 540 

ggcaggcaga tcacctgagg tcaagagttc aagaccggcc tggccaacat ggtgaaaccc 600 

cgtctntact aaaaatacaa aaattagacg agcgtggtgg tggacacctg tagtcccagc 660 

tgcttgggag gctgaggcag gagactagct ggaaccaggg aggtagaggt tgcagtgagc 720 

tgagatcgtg ccactgcact ccagcctggg tgacagagtg agactccatc ataaataaat 780 

40 aaataaataa atgggtcnc 799 

<210> 12 
<211> 812 
<212> PRT 
45 <213> Homo sapiens 

<400> 12 

Met Ala Gly Gly His Cys Gly Ser Phe Pro Ala Ala Ala Ala Gly Ser 
15 10 15 

50 Gly Glu He Val Gin Leu Asn Val Gly Gly Thr Arg Phe Ser Thr Ser 
20 25 30 

Arg Gin Thr Leu Met Trp He Pro Asp Ser Phe Phe Ser Ser Leu Leu 

35 40 45 

Ser Gly Arg He Ser Thr Leu Arg Asp Glu Thr Gly Ala He Phe He 
55 50 55 60 

Asp Arg Asp Pro Ala Ala Phe Ala Pro He Leu Asn Phe Leu Arg Thr 
65 70 75 80 

Lys Glu Leu Asp Leu Arg Gly Val Ser He Asn Val Leu Arg His Glu 

85 90 95 

Ala Glu Phe Tyr Gly He Thr Pro Leu Val Arg Arg Leu Leu Leu Cys 

100 105 110 

Glu Glu Leu Glu Arg Ser Ser Cys Gly Ser Val Leu Phe His Gly Tyr 

115 120 125 

Leu Pro Pro Pro Gly He Pro Ser Arg Lys lie Asn Asn Thr Val Arg 
65 130 135 140 

Ser Ala Asp Ser Arg Asn Gly Leu Asn Ser Thr Glu Gly Glu Ala Arg 
!45 150 155 160 
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■ Gly Asn Gly Thr Gin Pro Val Leu Ser Gly Thr Gly Glu Glu Thr Val 
165 170 175 

Arg Leu Gly Phe Pro Val Asp Pro Arg Lys Val Leu He Val Ala Gly 
180 185 190 

5 His His Asn Trp He Val Ala Ala Tyr Ala His Phe Ala Tyr Arq He 
195 200 205 

Lys Glu Ser Ser Gly Trp Gin Gin Val Phe Thr Ser Pro Tyr Leu Asp 

210 215 220 

Trp Thr He Glu Arg Val Ala Leu Asn Ala Lys Val Val Gly Glv Pro 
10 225 230 235 240 

His Gly Asp Lys Asp Lys Met Val Ala Val Ala Ser Glu Ser Ser He 

245 250 255 

He Leu Trp Ser Val Gin Asp Gly Gly Ser Gly Ser Glu He Gly Val 
260 265 270 

15 Phe Ser Leu Gly Val Pro Val Asp Ala Leu Phe Phe lie Gly Asn Gin 
275 280 285 

Leu Val Ala Thr Ser His Thr Gly Lys Val Gly Val Trp Asn Ala Val 

290 295 300 

Thr Gin His Trp Gin Val Gin Asp Val Val Pro He Thr Ser Tvr Aso 
20 305 310 315 320 

Thr Ala Gly Ser Phe Leu Leu Leu Gly Cys Asn Asn Gly Ser lie Tyr 

325 330 335 

Tyr lie Asp Met Gin Lys Phe Pro Leu Arg Met Lys Asp Asn Asp Leu 
340 345 350 

25 Leu Val Thr Glu Leu Tyr His Asp Pro Ser Asn Asp Ala lie Thr Ala 
355 360 365 

Leu Ser Val Tyr Leu Thr Pro Lys Thr Ser Val Ser Gly Asn Trp lie 

370 375 380 

Glu lie Ala Tyr Gly Thr Ser Ser Gly Ala Val Arg Val lie Val Gin 
30 385 390 395 400 

His Pro Glu Thr Val Gly Ser Gly Pro Gin Leu Phe Gin Thr Phe Thr 

405 410 415 

Val His Arg Ser Pro Val Thr Lys lie Met Leu Ser Glu Lys His Leu 
420 425 430 

35 Val Ser Val Cys Ala Asp Asn Asn His Val Arg Thr Trp Thr Val Thr 
435 440 445 

Arg Phe Gly Met lie Ser Thr Gin Pro Gly Ser Thr Pro Leu Ala Ser 

450 455 460 

Phe Lys lie Leu Ser Leu Glu Glu Thr Glu Ser His Gly Ser Tyr Ser 
40 . 465 470 475 480 

Ser Gly Asn Asp lie Gly Pro Phe Gly Glu Arg Asp Asp Gin Gin Val 

485 490 495 

Phe lie Gin Lys Val Val Pro lie Thr Asn Lys Leu Phe Val Arg Leu 
500 505 510 

45 Ser Ser Thr Gly Lys Arg lie Cys Glu lie Gin Ala Val Asp Cys Thr 
515 520 525 

Thr lie Ser Ser Phe Thr Val Arg Glu Cys Glu Gly Ser Ser Arg Met 

530 535 540 

Gly Ser Arg Pro Arg Arg Tyr Leu Phe Thr Gly His Thr Asn Gly Ser 
50 545 550 555 560 

He Gin Met Trp Asp Leu Thr Thr Ala Met Asp Met Val Asn Lys Ser 

565 570 575 

Glu Asp Lys Asp Val Gly Gly Pro Thr Glu Glu Glu Leu Leu Lys Leu 
580 585 590 

55 Leu Asp Gin Cys Asp Leu Ser Thr Ser Arg Cys Ala Thr Pro Asn lie 
595 600 605 

Ser Pro Ala Thr Ser Val Val Gin His Ser His Leu Arg Glu Ser Asn 

610 615 620 

Ser Ser Leu Gin Leu Gin His His Asp Thr Thr His Glu Ala Ala Thr 
60 625 630 635 640 

Tyr Gly Ser Met Arg Pro Tyr Arg Glu Ser Pro Leu Leu Ala Arg Ala 

645 650 655 

Arg Arg Thr Glu Ser Phe His Ser Tyr Arg Asp Phe Gin Thr lie Asn 

660 665 670 

Leu Asn Arg Asn Val Glu Arg Ala Val Pro Glu Asn Gly Asn Leu Gly 

675 680 685 

Pro lie Gin Ala Glu Val Lys Gly Ala Thr Gly Glu Cys Asn He Ser 



65 
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690 

Glu Arg Lys Ser 
705 

Ser Gly Leu Glu 

Lys Arg Ser Ser 
740 

Lys Gly Gly Phe 
755 

Tyr Leu Ala Ser 
770 

Gly Thr Ala Ser 
785 

Lys Lys Ser Asp 



695 

Pro Gly Val Glu 
710 

Val His Lys lie 
725 

Glu Asp Glu Asn 

Glu Gly Gly Gly 
760 

Ser Pro Ser Thr 
775 

Pro Ser Pro Thr 
790 

Ser Ser Gly Gin 
805 



700 

lie Lys Ser Leu 
715 

Ala Glu Gly Phe 
730 

Glu Asn Lys lie 
745 

Phe Leu Gly Arg 

Ser Asp Gly Gly 
780 

Lys Thr Thr Pro 
795 

Glu Tyr Ser Leu 
810 



Arg Glu Leu Asp 
720 

Ser Glu Ser Lys 
735 

Glu Phe Arg Lys 
750 

Lys Lys Val Pro 
765 

Thr Asp Ser Pro 

Ser Pro Arg His 
800 



<210> 13 
<211> 167 
<212> PRT 

<213> Homo sapiens 



<400> 13 
Leu Ala Ser Ala 
1 

Pro Pro Gly Ser 
20 

Leu Leu Gly Phe 
35 

Val Asn Phe Phe 
50 

His Arg Lys Arg 
65 

Lys lie Pro Ala 

Glu Glu Leu Glu 
100 

Gin Glu Glu Cys 
115 

Leu Ala Ala Thr 
130 

Glu Val Pro Glu 
145 

Cys Val Ser Asn 



Arg 


He 


Pro 


Gly 


5 








Leu 


Phe 


Phe 


Ala 


Cys 


Tyr 


Met 


Ala 








40 


Arg 


Pro 


Phe 


Thr 






55 




Asn 


Asn 


Leu 


Thr 




70 






Val 


Thr 


Tyr 


Pro 


85 








Leu 


Asp 


Lys 


Trp 


Val 


Ser 


Thr 


He 








120 


Arg 


Glu 


Phe 


He 






135 




His 


He 


Thr 


Glu 




150 






Thr 


Ala 


Lys 




165 









Arg Pro Tyr Val 
10 

Pro Ala Gly Thr 
25 

Ala Phe Leu Lys 

Arg Phe Leu Val 
60 

He Leu Gin Arg 
75 

Lys Asn Glu Ser 
90 

Lys Thr Thr Met 
105 

Ser Ser Ser Lys 

Glu Met Trp Arg 
140 

Glu Glu Leu Lys 
155 



Gly Val Leu Arg 
15 

Thr He Ser Ala 
30 

Met Ser Val Ser 
45 

Pro Phe Thr Leu 

Tyr Met Ser Ser 
80 

Thr Arg Pro Ser 
95 

Lys Ser Ser Val 
110 

Asp Glu Asp Pro 
125 

Leu Leu Gly Arg 

Thr Leu Met Glu 
160 



<210> 14 
<211> 452 
<212> PRT 

<213> Homo sapiens 



<400> 14 



Pro 


Cys 


Glu 


Gly 


Gly 


Ala 


Arg 


Ser 


Cys 


Leu 


Val 


Thr 


Glu 


Ser 


Ala 


Arg 


1 








5 










10 










15 


Gly 


Gly 


Leu 


Gin 


Phe 


Leu 


Gin 


Gin 


Cys 


Asp 


Arg 


Glu 


Asp 


Leu 


Val 


Glu 








20 










25 








30 






Leu 


Ala 


Leu 
35 


Pro 


Gin 


Leu 


Ala 


Gin 
40 


Val 


Val 


Thr 


Val 


Tyr 
45 


Glu 


Phe 


Leu 


Leu 


Met 


Lys 


Val 


Glu 


Lys 


Asp 


His 


Leu 


Ala 


Lys 


Pro 


Phe 


Phe 


Pro 


Ala 




50 










55 








60 










He 


Tyr 


Lys 


Glu 


Phe 


Glu 


Glu 


Leu 


His 


Lys 


Met 


Val 


Lys 


Lys 


Met 


Cys 


65 










70 










75 






80 


Gin 


Asp 


Tyr 


Leu 


Ser 
85 


Ser 


Ser 


Gly 


Leu 


Cys 
90 


Ser 


Gin 


Glu 


Thr 


Leu 
95 


Glu 


He 


Asn 


Asn 


Asp 


Lys 


Val 


Ala 


Glu 


Ser 


Leu 


Gly 


He 


Thr 


Glu 


Phe 


Leu 








100 










105 








110 






Arg 


Lys 


Lys 
115 


Glu 


He 


His 


Pro 


Asp 
120 


Asn 


Leu 


Gly 


Pro 


Lys 
125 


His 


Leu 


Ser 
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Arg Asp Met Asp Gly Glu Gin Leu Glu Gly Ala Ser Ser Glu Lys Arg 

130 135 140 

Glu Arg Glu Ala Ala Glu Glu Gly Leu Ala Ser Val Lys Arg Pro Arq 
145 150 155 160 

5 Arg Glu Ala Leu Ser Asn Asp Thr Thr Glu Ser Leu Ala Ala Asn Ser 

165 170 175 

Arg Gly Arg Glu Lys Pro Arg Pro Leu His Ala Leu Pro Ala Gly Phe 

180 185 190 

Ser Pro Pro Val Asn Val Thr Val Ser Pro Arg Ser Glu Glu Ser His 
10 195 200 205 

Thr Thr Thr Val Ser Gly Gly Asn Gly Ser Val Phe Gin Ala Gly Pro 

210 215 220 

Gin Leu Gin Ala Leu Ala Asn Leu Glu Ala Arg Arg Gly Ser He Glv 
225 230 235 240 

15 Ala Ala Leu Ser Ser Arg Asp Val Ser Gly Leu Pro Val Tyr Ala Gin 

245 250 255 

Ser Gly Glu Pro Arg Arg Leu Thr Gin Ala Gin Val Ala Ala Phe Pro 

260 265 270 

Gly Glu Asn Ala Leu Glu His Ser Ser Asp Gin Asp Thr Trp Asp Ser 
20 275 280 285 

Leu Arg Ser Pro Gly Phe Cys Ser Pro Leu Ser Ser Gly Gly Glv Ala 

290 295 300 

Glu Ser Leu Pro Pro Gly Gly Pro Gly His Ala Glu Ala Gly His Leu 
305 310 315 320 

25 Gly Lys Val Cys Asp Phe His Leu Asn His Gin Gin Pro Ser Pro Thr 

'325 330 335 

Ser Val Leu Pro Thr Glu Val Ala Ala Pro Pro Leu Glu Lys He Leu 

340 345 350 

Ser Val Asp Ser Val Ala Val Asp Cys Ala Tyr Arg Thr Val Pro Lys 
30 355 360 365 

Pro Gly Pro Gin Pro Gly Pro His Gly Ser Leu Leu Thr Glu Gly Cys 

370 375 380 

Leu Arg Ser Leu Ser Gly Asp Leu Asn Arg Phe Pro Cys Gly Met Glu 
385 390 395 400 

35 Val His Ser Gly Gin Arg Glu Leu Glu Ser Val Val Ala Val Gly Glu 

405 410 415 

Ala Met Ala Leu Lys Phe Pro Met Gly Ala Met Ser Tyr Cys Leu Arq 

420 425 430 

Asp Arg Ser Arg Phe Leu Phe Arg Leu Pro Met Gly Leu Ser Cys Pro 
40 435 440 445 

Leu Gin Val Gin 
450 

<210> 15 
45 <211> 321 

<212> PRT 

<213> Homo sapiens 



50 



<400> 15 

Met Ala Glu Ala Glu Glu Asp Cys His Ser Asp Thr Val Arg Ala Asp 

15 10 15 

Asp Asp Glu Glu Asn Glu Ser Pro Ala Glu Thr Asp Leu Gin Ala Gin 

20 25 30 

Leu Gin Met Phe Arg Ala Gin Trp Met Phe Glu Leu Ala Pro Glv Val 
55 35 40 45 

Ser Ser Ser Asn Leu Glu Asn Arg Pro Cys Arg Ala Ala Arg Gly Ser 

50 55 60 • 

Leu Gin Lys Thr Ser Ala Asp Thr Lys Gly Lys Gin Glu Gin Ala Lys 
65 70 75 80 

60 Glu Glu Lys Ala Arg Glu Leu Phe Leu Lys Ala Val Glu Glu Glu Gin 

85 90 95 

Asn Gly Ala Leu Tyr Glu Ala He Lys Phe Tyr Arg Arg Ala Met Gin 

100 105 HO 

Leu Val Pro Asp He Glu Phe Lys lie Thr Tyr Thr Arg Ser Pro Asp 
65 115 120 125 

Gly Asp Gly Val Gly Asn Ser Tyr He Glu Asp Asn Asp Asp Asp Ser 
130 135 140 



BNSDOCID: <WO 0020587A2J_> 



WO 00/20587 PCT/US99/22873 

Lys Met Ala Asp Leu Leu Ser Tyr Phe Gin Gin Gin Leu Thr Phe Gin 

150 155 160 

Glu Ser Val Leu Lys Leu Cys Gin Pro Glu Leu Glu Ser Ser Gin He 

165 170 175 

His He Ser Val Leu Pro Met Glu Val Leu Met Tyr He Phe Arq TrD 

180 185 190 

Val Val Ser Ser Asp Leu Asp Leu Arg Ser Leu Glu Gin Leu Ser Leu 

195 200 205 

Val Cys Arg Gly Phe Tyr lie Cys Ala Arg Asp Pro Glu He Trp Arq 

210 215 220 

Leu Ala Cys Leu Lys Val Trp Gly Arg Ser Cys He Lys Leu Val Pro 
225 230 2 35 240 

Tyr Thr Ser Trp Arg Glu Met Phe Leu Glu Arg Pro Arg Val Ara Phe 

245 250 255 

Asp Gly Val Tyr He Ser Lys Thr Thr Tyr He Arg Gin Gly Glu Gin 

2 60 265 270 

Ser Leu Asp Gly Phe Tyr Arg Ala Trp His Gin Val Glu Tyr Tvr Ara 

275 280 285 

Tyr He Arg Phe Phe Pro Asp Gly His Val Met Met Leu Thr Thr Pro 
20 290 295 300 

Glu Glu Pro Gin Ser He Val Pro Arg Leu Arg Thr Arg Glu Tyr Gin 
305 310 315 320 



10 



15 



25 



30 



40 



50 



Asp 



<210> 16 
<211> 172 
<212> PRT 

<213> Homo sapiens 



<400> 16 

Ala Cys Pro Thr His Pro Pro Pro Pro Thr Ala Pro Arg Gly His Gin 

15 10 15 

Ser Cys Arg Leu Gly Asp Ser Trp Ser Val Lys Ser Arg Trp Cys Pro 
35 20 25 30 

Cys Pro Gin Glu Leu Gly Trp Ala Ser Cys Ala Arg Gly Val Cys Ala 

35 40 45 

Trp Ser Phe Cys Ala Trp He Cys He Phe He Ala Leu Leu Val Glu 

50 55 60 

Thr Pro Arg Pro Val His Pro Ala Lys Thr Pro Gin Ala Ala Cys Gly 
65 70 75 80 

Ser Arg Thr Leu Pro Pro Phe Pro Arg Cys Pro Leu Arg Ala Arg Ala 

85 90 95 

Ala Thr Gin Ala Cys Trp Leu Arg Pro Pro Leu Gly Gin Ala Leu Ala 
45 100 105 110 

Gin Pro Ala Glu Trp Gly Val Val Gly Gin Ser Pro Arg Ser Trp Ala 

115 120 125 

Pro Ala Gin Ala His Arg Ala Arg Pro His Pro Ala Ala Pro Arq Thr 

130 135 140 

Ala Thr Arg Gly Val Leu Pro Leu Cys Pro Ala Pro Gly Thr Asn Ser 
145 150 155 160 

Met Phe Gly Val Cys Leu Cys Leu Phe Phe Lys Lys 
165 170 

55 <210> 17 

<211> 472 
<212> PRT 

<213> Homo sapiens 

60 <400> 17 

Met Arg Glu Arg Phe Arg Asn Leu Asp Glu Glu Val Glu Lys Tyr Arg 

1 5 10 15 

Ala Val Tyr Asn Lys Leu Arg Tyr Glu His Thr Phe Leu Lys Ser Glu 
20 25 30 

65 Phe Glu His Gin Lys Glu Glu Tyr Ala Arg He Leu Asp Glu Gly Lys 
35 40 45 

He Lys Tyr Glu Ser Glu He Ala Arg Leu Glu Glu Asp Lys Glu Glu 
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50 

Leu Arg Asn 
65 

Val Glu Gin 

5 

Gly Leu Glu 

Glu Ala Gin 
115 

10 Glu Met Gin 
130 

Asn Leu Arg 
145 

Gin Asn Thr 

15 

Asn Thr Leu 

Glu lie Thr 
195 

20 Glu Arg Glu 
210 
Asp Asn Glu 

225 

Val Glu Lys 

25 

Glu Gly Tyr 

Glu Asn Arg 
275 

30 Trp Arg Gin 
290 
Gin Met Ala 
305 

Lys Leu Gin 

35 

Asn Glu Asn 

Val Thr Ser 
355 

40 Met Leu Lys 
370 
Arg Ser Gin 
385 

Glu Lys Gin 

45 

lie Thr Asp 

Arg Ala Ala 
435 

50 Leu Lys Arg 
450 
Glu Leu Glu 
465 

55 <210> 
<211> 
<212> 
<213> 

60 <400> 
Lys Ala Thr 
1 

Gin Leu lie 

65 Arg Ala Leu 
35 

Val lie Lys 



55 

Gin Leu Leu Asn 
70 

Leu Ala Arg Glu 
85 

Ala Glu Val Ala 
100 

Val Glu Asn Ala 

Ala Thr Val Arg 
135 

Ala Glu Arg Leu 
150 

Phe Leu lie Asn 
165 

Ser Ser Lys Val 
180 

Asp lie Lys Leu 

Arg Asn Lys Leu 
215 

lie Leu Lys Ala 
230 

Asp Arg Glu Leu 
245 

Gin Lys Leu Val 
260 

Leu Ala Asp Leu 

Ser Glu Lys Asp 
295 

Glu Glu lie Thr 
310 

Gin Gin lie Val 
325 

Ser Asp Leu Lys 
340 

Leu Ala Gin Ser 

Glu Met Val Glu 
375 

Ala Glu Lys Ala 
390 

lie Gin Trp Leu 
405 

Arg Glu Glu Lys 
420 

lie Ala Gin Lys 

Leu Gin Glu Lys 
455 

Thr Glu lie Arg 
470 

18 
4 62 
PRT 

Homo sapiens 
18 

Ser Leu Thr Leu 

5 

His Gly Arg Asp 
20 

Tyr His Trp Tyr 
Asn Ser Lys Thr 



60 

Val Asp Leu Thr Lys 
75 

Lys Val Tyr Leu Cys 
90 

Glu Leu Lys Ala Glu 
105 

Gin Arg lie Gin Val 
120 

Ser Leu Gly Ala Glu 
140 

Glu Lys Glu Leu Gin 
155 

Lys Leu His Lys Ala 
170 

Lys Glu Leu Lys His 
185 

Glu Thr Ala Arg Ala 
200 

Gin Ser Glu Leu Asp 
220 

Ala Val Glu His His 
235 

lie Arg Lys Val Gin 
250 

Val Leu Gin Asp Glu 
265 

Glu Lys Met Lys Val 
280 

Gin Tyr Glu Glu Lys 
300 

Arg Lys Glu Leu Gin 
315 

Thr lie Glu Asn Ala 
330 

Gin Gin lie Ser Ser 
345 

Glu Asn Asp Leu Leu 
360 

Arg Leu Lys Gin Glu 
380 

Gin Leu Glu Ala Glu 
395 

Glu Glu Lys His Lys 
410 

Tyr Asn Gin Ala Lys 
425 

Lys Arg Lys Ser Leu 
440 

Val Glu Val Leu Glu 
460 

Ser 



Glu Gly Gly Arg Leu 
10 

Tyr Glu Met Val Pro 
25 

Gly Ala Asn Leu Ala 
40 

Asp lie Pro Glu Leu 



Asp Ser Lys Arg 
80 

Gin Lys Leu Lys 
95 

Lys Glu Asn Ser 
110 

Arg Gin Leu Ala 
125 

Lys Gin Ser Ala 

Ser Ser Ser Glu 
160 

Glu Arg Glu lie 
175 

Ser Asn Lys Leu 
190 

Lys Ser Glu Leu 
205 

Gly Leu Gin Ser 

Lys Val Leu Leu 
240 

Ala Ala Lys Glu 
255 

Lys Leu Glu Leu 
270 

Glu His Asp Val 
285 

Leu Arg Ala Ser 

Ser Val Arg Leu 
320 

Glu Lys Glu Lys 
335 

Leu Gin lie Gin 
350 

Asn Ser Asn Gin 
365 

Cys Arg Asn Phe 

Lys Thr Leu Glu 
400 

Leu His Asp Arg 
415 

Glu Lys Leu Gin 
430 

His Glu Asn Lys 
445 

Ala Lys Lys Glu 



Lys Arg Thr Pro 
15 

Glu Pro Val Trp 
30 

Leu Pro Arg Pro 
45 

Glu Leu Phe Pro 
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50 55 60 

Arg Tyr Leu Leu Phe Leu Arg Gin Gin Pro Ala Thr Arg Thr Gin Gin 
65 70 75 80 

Ser Asn He Trp Val Asn Met Gly Asn Val Pro Ser Pro Asn Ala Pro 

85 90 95 

Leu Lys Arg Val Leu Ala Tyr Thr Gly Cys Phe Ser Arq Met Gin Thr 

100 105 HO 

He Lys Glu He His Glu Tyr Leu Ser Gin Arg Leu Arg He Lys Glu 

115 120 125 

Glu Asp Met Arg Leu Trp Leu Tyr Asn Ser Glu Asn Tyr Leu Thr Leu 

130 135 140 

Leu Asp Asp Glu Asp His Lys Leu Glu Tyr Leu Lys He Gin Asp Glu 
145 150 155 160 

Gin His Leu Val He Glu Val Arg Asn Lys Asp Met Ser Trp Pro Glu 
15 165 170 175 

Glu Met Ser Phe He Ala Asn Ser Ser Lys He Asp Arg His Lys Val 

180 185 190 

Pro Thr Glu Lys Gly Ala Thr Gly Leu Ser Asn Leu Gly Asn Thr Cys 
195 200 205 

20 Phe Met Asn Ser Ser He Gin Cys Val Ser Asn Thr Gin Pro Leu Thr 
210 215 220 

Gin Tyr Phe lie Ser Gly Arg His Leu Tyr Glu Leu Asn Arg Thr Asn 
225 230 235 240 

Pro He Gly Met Lys Gly His Met Ala Lys Cys Tyr Gly Asp Leu Val 
25 245 250 255 

Gin Glu Leu Trp Ser Gly Thr Gin Lys Asn Val Ala Pro Leu Lys Leu 

260 265 270 

Arg Trp Thr He Ala Lys Tyr Ala Pro Arg Phe Asn Gly Phe Gin Gin 

275 280 285 

Gin Asp Ser Gin Glu Leu Leu Ala Phe Leu Leu Asp Gly Leu His Glu 

290 295 300 

Asp Leu Asn Arg Val His Glu Lys Pro Tyr Val Glu Leu Lys Asp Ser 
305 310 315 320 

Asp Gly Arg Pro Asp Trp Glu Val Ala Ala Glu Ala Trp Asp Asn His 
35 325 330 335 

Leu Arg Arg Asn Arg Ser He Val Val Asp Leu Phe His Gly Gin Leu 

340 345 350 

Arg Ser Gin Val Lys Cys Lys Thr Cys Gly His He Ser Val Arg Phe 

355 360 365 

Asp Pro Phe Asn Phe Leu Ser Leu Pro Leu Pro Met Asp Ser Tyr Met 

370 375 380 

His Leu Glu He Thr Val He Lys Leu Asp Gly Thr Thr Pro Val Arg 
385 390 395 400 

Tyr Gly Leu Arg Leu Asn Met Asp Glu Lys Tyr Thr Gly Leu Lys Lys 
45 405 410 415 

Gin Leu Ser Asp Leu Cys Gly Leu Asn Ser Glu Gin He Leu Leu Ala 

420 425 430 

Glu Val His Gly Ser Asn He Lys Asn Phe Pro Gin Asp Asn Pro Lys 
435 440 445 

50 Ser Thr Glu Leu Leu Ser Glu Val Gly Phe Phe Gly Val Pro 
450 455 460 

<210> 19 
<211> 243 
55 <212> PRT 

<213> Homo sapiens 

<400> 19 

Pro Thr Trp Leu Glu Arg Gly Gly Gly Glu Pro Ala Ala Arq Pro Gin 
60 1 5 10 15 

Thr Pro Gin Pro Thr Ala Pro Glu Ser Arg Gly Pro Ser Gly Ala Ser 

20 25 30 

Ala Leu Arg Cys Arg Gly Pro Thr Ala Arg Ser Leu Pro Ala Ala Ser 
35 40 45 

65 Met Leu Gly Ala Pro Asp Glu Ser Ser Val Arg Val Ala Val Arg He 
50 55 60 

Arg Pro Gin Leu Ala Lys Glu Lys He Glu Gly Cys His He Cys Thr 



40 
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65 70 75 80 

Ser Val Thr Pro Gly Glu Pro Gin Val Phe Leu Gly Lys Asp Lys Ala 

85 90 95 

Phe Thr Phe Asp Tyr Val Phe Asp He Asp Ser Gin Gin Glu Gin He 

100 105 HO 

Tyr He Gin Cys He Glu Lys Leu He Glu Gly Cys Phe Glu Glv Tvr 

115 120 125 

Asn Ala Thr Val Phe Ala Tyr Gly Gin Thr Gly Ala Gly Lys Thr Tvr 

130 135 140 

Thr Met Gly Thr Gly Phe Asp Val Asn He Val Glu Glu Glu Leu Glv 
1" 150 155 160 

He He Ser Arg Ala Val Lys His Leu Phe Lys Ser He Glu Glu Lys 

165 170 175 

Lys His He Ala He Lys Asn Gly Leu Pro Ala Pro Asp Phe Lys Val 
15 180 185 190 

Asn Ala Gin Phe Leu Glu Leu Tyr Asn Glu Glu Val Leu Asp Leu Phe 

195 200 205 

Asp Thr Thr Arg Asp He Asp Ala Lys Ser Lys Lys Ser Asn He Arq 

210 215 220 

He His Glu Asp Ser Thr Gly Gly Asn Leu Tyr Cys Gly Arg Phe Gin 
225 230 235 240 

His Val Leu 



10 



20 



35 



45 



25 <210> 20 

<211> 223 
<212> PRT 
<213> Homo sapiens 

30 <400> 20 

Gly Asn Tyr Arg Pro Ser Lys Lys Arg Phe Arg Glu Gly Lys Glu Lys 

15 10 15 

Lys Arg Gly Lys Lys Val Trp Leu Val Lys Arg He He Gin Thr Leu 

20 25 30 

Thr Lys Arg Arg Lys Lys Lys Arg Val Phe Arg Lys Glu Lys Pro Asn 

35 40 45 

Glu Leu Glu Val Glu Glu Ser Gin Glu Val Ser Asp His Glu Asp Glu 

50 55 60 

Glu Glu Glu Glu Glu Glu Glu Glu Asp Asp He Asp Gly Gly Glu Ser 
40 65 70 75 80 

Ser Asp Glu Ser Asp Ser Glu Ser Asp Glu Lys Ala Tyr Gin Ala Asp 

85 90 95 

Leu Ala Asn He Thr Cys Glu lie Ala He Lys Gin Lys Leu lie Asp 

100 105 HO 

Glu Leu Glu Asn Ser Gin Lys Arg Leu Gin Thr Leu Lys Lys Gin Tyr 

115 120 125 

Glu Glu Lys Met Met Leu Gin His Lys He Arg Asp Thr Gin Leu Glu 

130 135 140 

Arg Asp Gin Val Leu Gin Asn Leu Gly Ser Val Glu Ser Tyr Ser Glu 
50 145 150 155 160 

Glu Lys Ala Lys Lys Val Arg Ser Glu Tyr Glu Lys Lys Leu Gin Ala 

165 170 175 

Met Asn Lys Glu Leu Gin Arg Leu Gin Ala Ala Gin Lys Glu His Ala 
180 185 190 

55 Arg Leu Leu Lys Asn Gin Ser Gin Tyr Glu Lys Leu Lys Lys Leu Gin 
195 200 205 

Gin Asp Val Met Glu Met Lys Lys Thr Lys Val Arg Leu Met Lys 
210 215 220 

60 <210> 21 

<211> 206 
<212> PRT 

<213> Homo sapiens 

65 <400> 21 

Met Asn Lys Pro He Thr Pro Ser Thr Tyr Val Arg Cys Leu Asn Val 
15 10 15 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



Gly Leu lie 

Lys Lys Leu 
35 

Asn Gin Phe 
50 

Ser Pro Thr 
65 

Val Gly Asp 

Ala Ser Leu 

Pro Ser Lys 
115 

Asp Lys Asp 
130 

Tyr Met Pro 
145 

Ser Asp Thr 

Thr Asn Asn 

Gly Glu Gly 
195 



Arg Lys 
20 

Ala Val 

His lie 

Ser Glu 

Leu Val 

85 
Leu Leu 
100 

Glu Ala 

Arg Thr 

Pro Asp 

Arg Phe 
165 
Phe Asp 
180 

Gly Phe 



Leu Ser 

Ala He 

Arg Arg 

55 
Leu Leu 
70 

Asp Leu 

Pro Asp 

He Thr 

Leu Met 
135 
Ser Ser 
150 

His Ser 
Glu Arg 
Gly Val 



Asp Phe 

25 
Lys Lys 
40 

Phe Glu 

Phe Asp 

Leu He 

Ala Val 
105 
Val Gin 
120 

Thr Pro 

Ser Pro 

Phe Ser 

Pro He 
185 
Val Tyr 
200 



He Asp 

Pro Ser 

Ala Leu 

Trp Gly 

75 
Gin Asn 
90 

Pro Lys 

Gin Lys 

Val Gin 

Glu Asn 
155 
Phe Tyr 
170 

Ser Val 
Lys Gly 



Pro Gin 

Gly Asp 

45 
Leu Gin 
60 

Thr Thr 

Glu Phe 

Thr Ala 

Gin Met 
125 
Asn Leu 
140 

Lys Ser 

Glu Leu 

Gly Gly 

Leu Arg 
205 



Glu Gly Trp 

30 

Asp Arg Tyr 

Thr Gly Lys 

Asn Cys Thr 
80 

Phe Ala Pro 
95 

Asn Thr Leu 
110 

Pro Phe Cys 

Glu Gin Ser 

Leu Glu Val 
160 

Lys Asn Val 

175 
Asn Lys Met 
190 
Lys 



<210> 22 

<211> 1260 

<212> DNA 

<213> Homo sapiens 



<400 
cttctccgca 
actcttaagg 
aagatcactg 
aagtctctgg 
atgatggatc 
gattattctg 
gaaggggaga 
cctcagatcg 
attcttacgt 
ggatgtaatc 
cccagcagct 
agtggtgtga 
aatgatagtg 
atcaagctaa 
tccatgttga 
ggcattgaga 
agcgttatca 
accctgtggg 
aaatttaaaa 
cacagtttgc 
cttggtttaa 



> 22 
cgactgttac 
aaaaactcat 
tagtgggtgt 
ctgatgaact 
tgcagcatgg 
tgaccgccaa 
gtcggctcaa 
tcaagtacag 
atgttacctg 
tggattctgc 
gccatggatg 
atgtggcagg 
aaaattggaa 
aaggatatac 
aaaatctatc 
atgaagtctt 
accagaagct 
acatccagaa 
actacaatgt 
tttgatcttc 
tgcttgcaat 



agaggtctcc 
tgcaccagtt 
tggacaagtt 
tgctcttgtg 
gagcttattt 
ttctaagatt 
tctggtgcag 
tcctgattgc 
gaaactaagt 
tagatttcgc 
gatt ttgggg 
tgtttctctc 
ggaagtgcat 
caactgggct 
caggattcat 
cctgagcctt 
aaaggatgat 
ggacctaaaa 
gattaactcg 
ttcaatatgt 
ctgagctctt 



agagccttct 
gcggaagaag 
ggtatggcgt 
gatgttttgg 
cttcagacac 
gtagtggtaa 
agaaatgtta 
atcataattg 
ggattaccca 
taccttatgg 
gaacatggcg 
caggaattga 
aagatggtgg 
attggattaa 
cccgtgtcaa 
ccatgtatcc 
gaggttgctc 
gacctgtgac 
agcctttagt 
gaatttgggc 
gaacaaataa 



ctctcctgtg 
aggcaacagt 
gtgctatcag 
aagataagct 
ctaaaattgt 
ctgcaggagt 
atgtcttcaa 
tggtttccaa 
aacaccgcgt 
ctgaaaaact 
actcaagtgt 
atccagaaat 
ttgaaagtgc 
gtgtggctga 
caatggtaaa 
tcaatgcccg 
agctcaagaa 
tagtgagctc 
tttcatccat 
tcacagaatc 
aattaactat 



caaaatggca 
tccaaacaat 
cattctggga 
taaaggagaa 
ggcagataaa 
ccgtcagcaa 
attcattatt 
cccagtggac 
gattggaagt 
tggcattcat 
ggctgtgtgg 
gggaactgac 
ctatgaagtc 
tcttat tgaa 
ggggatgtat 
gggattaacc 
aagtgcagat 
taggctgtag 
gtacatggat 
aaagcctatg 
tgtagtgtga 



<210> 23 

<211> 3151 

<212> DNA 

<213> Homo spaiens 



<400 
taacacagtt 
tatgcagaaa 
cctccat cag 
taccaggtgt 
cggcgccgtg 
agcatcccca 
gactttgact 
gataccattg 



> 23 
gtgaaaagag 
cgctaatgga 

ggtgggaaga 

gcaacgtgtt 
gcgcccaccg 
gcgtgcctgg 
cggccaccaa 
cagccgacga 



atggatgtgg 
ctccactaca 
ggtgagtggc 
tgagtcaagc 
catccacgtg 
ctcctgcaag 
gaccttcccc 
gagcttctcc 



gttccagtcc 
gcgactgctg 
tacgatgaga 
cagaacaact 
gagatgaagt 
gagaccttca 
aactggatgg 
caggtggacc 



tagccctgcc 
agctgggctg 
acatgaacac 
ggctacggac 
tttcggtgcg 
acctctatta 
agaatccatg 
tgggtgaccg 



tgtgtgcact 
gatggtgcat 
gatccgcacg 
caagtttatc 
tgactgcagc 
ctatgaggct 
ggtgaaggtg 
cgtcatgaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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atcaacaccg 
caggactatg 
cgcatcatcc 
ctggtggctg 
ctctactgta 
ggcttcgagg 
gccaaccaag 
ggggccacca 
atgccctgca 
tccctcatgc 
aacatcatct 
gtacagtacg 
ctggcccaca 
cccttctcgc 
gtgtccatca 
ccagaccagc 
ctcagtgagt 
ctcaaagccg 
cgctacagcg 
caggagaagt 
gtggttgtca 
acggacaagc 
gatcctttca 
atctcctgtg 
ggccacctga 
ggctacacgg 
gaccatccca 
atcaccgagt 
ttcacagtca 
gcagacatga 
ctggtctgca 
cccacctaca 
at ccagtacc 
gaggtgatgt 
gccattgagc 
ctcatgctgg 
aacacgctag 
tctggcatca 
gtggacgagt 
ggcttcacct 
gtcactttgg 
atgaaccaga 
caagccccgc 
cagccact eg 
tcaccaagaa 



aggtgeggag 
geggctgeat 
agaatggege 
cccggggcag 
aeggggaegg 
ccgttgagaa 
gggatgaggc 
actgtgtctg 
caaccatccc 
tggagtggac 
gcaagagctg 
caccacgcca 
cccagtacac 
ctcagttcgc 
tgcatcaggt 
ccaatggcgt 
acaacgccac 
gcgccatcta 
gcaagatgta 
tgccactcat 
tcgccatcgt 
tgcaacacta 
cctacgagga 
tcaaaattga 
agetgecagg 
agaagcagcg 
acgtcatcca 
tcatggagaa 
t ccagctggt 
actatgttca 
aggtgtcgga 
ccagtgccct 
ggaagttcac 
cctatgggga 
aggactatcg 
actgttggca 
acaagatgat 
acctgccgct 
ggctgaaggc 
ectttgaegt 
ctggccacca 
ttcagtctgt 
cccctctgcc 
ccaggaggcc 
aacatgeaac 



cttcggacct 
gtccctcatc 
catcttccag 
ctgcatcgcc 
cgagtggctg 
tggcaccgtc 
ctgtacccac 
ccgcaatggc 
ctccgcgccc 
ccctccccgc 
tggcteggge 
getaggectg 
cttcgagatc 
ctctgtgaac 
gagccgcacc 
gatcctggac 
agecataaaa 
tgtcttccag 
ct tccagacc 
catcggctcc 
gtgtaacaga 
caccagtggc 
ccccaacgag 
gcaggtgatc 
caagagagag 
ccgggacttc 
cctggagggt 
tggctccctg 
gggcatgett 
ccgtgacctg 
ctttgggctc 
gggcggaaag 
ctcggccagt 
gcggccctac 
gctgccaccg 
gaaggacege 
ccgc'aatccc 
gctggaccgc 
catcaagatg 
cgtgtctcag 
gaaaaaaatc 
ggaggtttga 
ccacgtgccg 
aegggcaegg 
t caaacgacg 



gtgtcccgca 
gccgtgcgtg 
gaaaccctgt 
aatgcggaag 
gtgcccatcg 
tgccgaggtt 
tgtcccatca 
tactacagag 
caggctgtga 
gaetceggag 
eggggtgect 
accgagccac 
caggctgtga 
atcaccacca 
gtggacagca 
tatgagctgc 
agccccacca 
gtgegggcac 
atgacagaag 
tcggccgctg 
cgggggtttg 
cacatgaccc 
gcagtgcggg 
ggagcagggg 
atctttgtgg 
ctgagcgaag 
gtcgtgacca 
gactcctttc 
eggggcateg 
gctgcccgca 
tcacgctttc 
atcccca tec 
gatgtgtgga 
tgggacatga 
cccatggact 
aaccaccggc 
aacagcctca 
acgatccccg 
gggcagtaca 
atgatgatgg 
ctgaacagta 
cattcacctg 
gccctcctgg 
gaagaaccaa 

g 



geggcttcta 
tcttctaccg 
egggggctga 
aggtggatgt 
ggegctgeat 
gtccatctgg 
acagccggac 
cagacctgga 
tttccagtgt 
gecgagagga 
gcacccgctg 
gcatttacat 
aeggegttae 
accaggcagc 
ttaccctgtc 
agtactatga 
acacggtcac 
gcaccgtggc 
ccgattacca 
gectggtett 
agcgtgctga 
caggcatgaa 
agtttgccaa 
agt ttggcga 
ccatcaagac 
cctccatcat 
agagcacacc 
teeggcaaaa 
cagctggcat 
acatcctcgt 
tagaggacga 
gctggacagc 
getaeggcat 
ccaaccagga 
gcccagctgc 
ecaagttegg 
aagccatggc 
actacaccag 
aggagagctt 
aggacattct 
tccaggtgat 
cctcggctca 
tgctctatcc 
gcggtgccag 



cctggccttc 
caagtgcccc 
gagcacatcg 
acccatcaag 
gtgeaaagea 
gactttcaag 
cacttctgaa 
ccccctggac 
caatgagacc 
cctcgtctac 
eggggacaat 
cagtgacctg 
tgaccagagc 
tccatcggca 
gtggtcccag 
gaagcaggag 
cgtgcagggc 
aggctaeggg 
gacaagcatc 
cctcattgct 
cteggagtae 
gatctacatc 
ggaaattgac 
ggtctgcagt 
gctcaagtcg 
gggccagttc 
tgtgatgatc 
cgatgggcag 
gaagtacctg 
caacagcaac 
tacctcagac 
cccggaagcc 
tgtcatgtgg 
tgtaatcaat 
cctgcaccaa 
ccaaattgtc 
gcccctctcc 
ctttaacacg 
cgccaatgcc 
ccgggttggg 
gcgggcgcag 
cctcttcctc 
actgeaggge 
ccacgagacg 



<210> 24 

<211> 1234 

<212> DNA 

<213> Homo sapiens 



<400 
tagttcaaga 
aggctttcaa 
tggccaacgc 
agaagatcgt 
get tccagca 
atgactttca 
gagttatcaa 
aggaagttct 
gtcataatcg 
tcatcgatga 
gcgccaactg 
gcctcccatg 
aagggcgcgt 
agtgtgagaa 
aaactcacta 



> 24 
caacagagac 
agataattcg 
cctggccagc 
gaacagtaat 
gttcccagaa 
gatgetcttt 
agecatgaat 
ggcagatatc 
tgagaaagee 
gcagcctctg 
egggaaggag 
ccatgataaa 
ggtgaacgct 
accctttctt 
taaccagcta 



aaagctaaga 
cagtgatgtg 
gccacttgcg 
ggggagctgt 
ggactct tct 
gccccttgct 
aacagctggc 
gggtttgtca 
agaggecttg 
atattcaaga 
ctgactgccg 
atgggggtcc 
atgggcaagc 
ggacatcgcc 
tttggtgatg 



tgaggaagtt 
aaactggcct 
agegctgeaa 
accatgagca 
atgagtttga 
gtcatcagtg 
ateeggagtg 
agaatgetgg 
ggaaatacat 
acgaccccta 
atgcacggga 
ccatctgtgg 
agtggcatgt 
attatgagag 
tttgettcca 



ctgtacagtt 
cccaagccct 
gggcggcttt 
gtgtttcgtg 
aggaagaaag 
tggtgaattc 
cttccgctgt 
gagacacctg 
ctgccagaaa 
ccatccagac 
gctgaaaggg 
tgettgeega 
ggagcatttt 
gaaaggcctg 
ctgcaatcgt 



taggaaatag 
gataacaaca 
gcgcccgctg 
tgcgctcagt 
tactgtgaac 
atcat tggee 
gacctctgcc 
tgtcgcccct 
tgecatgeta 
cat ttcaact 
gagctatact 
cggcccatcg 
gtttgtgcca 
gcatattgtg 
gttatagaag 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
-1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3151 
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gtgatgtggt 
gcaacactaa 
agaagtgcta 
ttaggaagga 
attacttgtc 
acctatatga 



ctctgctctt aataaggcct ggtgcgtgaa ctgctttgcc tgttctacct 
attaacactc aagaataagt ttgtggagtt tgacatgaag ccagtctgta 
tgagatttcc attggagctg aagaaaagac ttaagaaact agctgagacc 
aataagttcc tttatttttt cttttctatg caagataaga gattaccaac 
ttgatctacc catatttaaa gctatatctc aaagcagttg agagaagagg 
atggttttat gtcatttttt taaa 



960 
1020 
1080 
1140 
1200 
1234 



<210> 25 
<211> 4534 
<212> DNA 

<213> Homo sapiens 



<400> 25 
gtcacgagcg tcgaagagac 
gggcttgttg gtgccccagc 
gccgccgccg cctcgcaaca 
cgcgaccctc cggggcacct 
cggttcgggt ccaagatgat 
cacttcacag aagttccagt 
aaagaacccg gcgagagtga 
gagtttcatc atgttggcca 
ggcctcccaa agtgctggat 
aaagaacgtc cagttgcgga 
cagaggaacg aagttcgctt 
agtggaccaa gatctcagga 
tatcgaagaa aggagaacgg 
caggaaatgg catctcgcca 
aaggaacagc gcttaaagtt 
gagcaggaga aacttaaaag 
aaagtgagag cacttaaagg 
gaggaaattg aacagatgaa 
gtgtcaaaag tagaagaact 
agccaccatg acaatcagtc 
ctaagaaaca aattgaatca 
aataagcgta attcagaagt 
ctgtggaaga agaaggcagc 
aatcttcccc agcaagccgc 
cagtcatcta ctatgcctcg 
ccggatggtt ccttggtcat 
aacatgagat ctggggctgc 
gattggagtc cttcaaatgc 
agcactggga atgctctgga 
aagaaagtgc gtccgttctc 
tttggtactc tgaggaagaa 
aataaaaatg tggctaaagt 
ccttattttg gacaaactaa 
cagttgtcaa cagttgttcc 
agagtgctgc tatctcccag 
tctaagcaag aaagtccacc 
gacaccttac ttccaccctt 
atgtatacgc aacagcaggc 
accaagactc ataccagagg 
gctgcccaga atcaacagca 
ggcagtccag aacctgaaac 
agaattcctc ggccactcag 
aaccagagtg atgctgacct 
ctaaagaaac gtagttctat 
cttttatatc agaggaccac 
tccaagtcag cttctgtgac 
ttacatgtgg agcccgaaaa 
gatgtgggga atgccagtac 
tatgagcctg agggagtccc 
aatccagagg ctccacatgt 
ccaccatacc catctgggga 
cctgaaatca ccgggcaggt 
ggctcagagc gtatcgctca 
gattcgtctt tggagggaga 



aaagccgcgt 
ccgcgcggag 
ggtccgggcg 
cagctcggcc 
gccgatgttt 
tactccagaa 
ttgccatttg 
ggatggtctc 
tacaggtgtg 
taatgagcga 
cttccttcgt 
tccaagttta 
tgttaatagt 
gcagcaacag 
tttgaaacaa 
gctaaaagaa 
ccacgtggaa 
taatttgttc 
gaccaggcag 
tgcagtggct 
agagcagaat 
ggcagtcatg 
tctacagcaa 
gtcagcccca 
gatgccctca 
tcaggcttca 
ttcacaaact 
agatcttttc 
tcaagttgat 
aatgtttgat 
ccagagcagt 
accacctcct 
tcagccacct 
gtccatggga 
cataccttcg 
tgctgctgcc 
cagaaaaccc 
gccaggaaaa 
gccacacttt 
gcacccagag 
agagcctgtt 
cccaactaaa 
agaagcctta 
tacagagcca 
catagcggcc 
tgccagctca 
ggaggtggtc 
agagaacagt 
agacaacagc 
gcttgatgtg 
gcctgaaggg 
ctctctgcct 
tggaatgagg 
atttgacctt 



cagggggccc 
ggcccttcgg 
gcctcgctct 
ggggccgcag 
cttaccgtgt 
acaatatgca 
gctgaagtgt 
gatctcctga 
agccaccacg 
atgtttgatg 
catgaacgcc 
aaaagaaatg 
cctaggatgg 
attgaagccc 
caagatcagc 
atagctgaga 
cagaagagac 
cagcaaaaac 
ctagagatgc 
gagcttgatc 
gccaagctac 
gataagcgtg 
aaagaaaatc 
agccgtgtgg 
aggcctgaat 
gaggggccga 
aaaggctcta 
ccaagccaag 
gatggagagg 
gcagtagacc 
gaagatatct 
gttoctacaa 
tcagacatta 
actaaaccaa 
gt tggccaag 
gt ccggccct 
cagaccgtgg 
aacttccagc 
tcaagtgtat 
aacatttatt 
tcttcagttc 
ttactgcctt 
cgaaagaaac 
gagggtccta 
atggagacca 
gaaagcccag 
tctctggttc 
gacatgccag 
ccaaatctcc 
tacctggagg 
cccggagaag 
cctggtaaaa 
gtgaaattca 
gtacagagaa 



ggccggggcg 
acccgcgcgc 
ccgctcccct 
tctggccacc 
at ctcagtaa 
gagacgtggt 
ggtgtggctc 
ccttgtgatc 
atcagcctct 
t tct tcaacg 
cccctggcag 
gtgtaaaagt 
atctgactct 
agcaacaat t 
gacaacagca 
atcaggaagc 
taagcaatgg 
agagggagct 
tcaagaacgg 
gcctctataa 
aacaacagag 
t taatgagct 
taccagtttc 
ctgcagtagg 
tgctggtgaa 
tgaaaataca 
aaatccatcc 
gctctgcttc 
t tccgctgag 
agtccaatgc 
tgcgggatgc 
aaccaaaaca 
agccagacgg 
aaccagcagg 
accagaccct 
ttactcccca 
cagcaagttc 
aggctgtgca 
atggtaagcc 
ccaatagcca 
aggagaacca 
tcttatctaa 
tgtctaacgc 
atgggccaaa 
tctctgtccc 
tagaaatcca 
ctgaatcatt 
ctccttctcc 
agaataaccc 
agtaccctcc 
actcggtgag 
ggacaaactt 
acccccttgc 
ttatttatga 



ggggagcccg 
cgccgctgcc 
cccccgcatc 
cgcttccatg 
caatgagcag 
ggatctgtgc 
tgtagagata 
cgcctgcct c 
agtgtttaaa 
att tggaagt 
ggacattgtg 
tcctggtgaa 
tgctgaactt 
gctggcaact 
acaagttgct 
taagctaaaa 
gaaacttgtg 
cgtcctggct 
caggatcgac 
ggagctgcag 
ggagtgtttg 
gagggaccgg 
atctgatgga 
tccctatatc 
gccagccctg 
gacactgccc 
agttggccct 
tgtacctcaa 
ggagaaagag 
cccaccttcc 
tcaggttgca 
gattaatttg 
aagttctcag 
gcagcagccg 
ttctccaggt 
gcct tccaaa 
aatatattcc 
gagcgcgttg 
tgtaattgct 
gggcaagcct 
tgaaaacgaa 
tccttaccga 
accaaggcct 
tattcagaag 
atcataccca 
gaatccatat 
gtccccagag 
aggccttgat 
agaagaacca 
atacccaccc 
catgcgcccg 
gcgtaaaact 
tt tactgcta 
ggttgatgac 
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1800 
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2100 
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2280 
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ccaagcctgc 

acagaaatcg 

ggatggactc 

gtggagtcag 

aagtgcgagg 

gagaagatgg 

aatgatgatg 

gaagatgaaa 

aacttgctgg 

cacacagaat 

tttggcaaaa 

tgtctctaga 

aaataaagca 

ctcagataag 

tcatttattg 

tcctaaacca 

ccccggcacc 

attgtcagtg 

tgtagcatct 

aattagatgt 

t ctcagaaat 

ctttaaaaac 



ccaatgatga 
ttaagttcct 
cattacattg 
gagccgctgt 
aaatggagga 
gcataatgaa 
agctgcccat 
tcgaatggtg 
gactgtaccc 
tttagtcaat 
atttcacaag 
agagaatgaa 
tctacgtcag 
tatacagcga 
ggcagttggg 
gtcctcctgc 
agcaggcctt 
aagaacgtta 
taagaattat 
agaatcttgt 
tagaaacttg 
agttgtaata 



aggcatcacg 

ggtacagttt 

tgctgcctca 

gtttgccatg 

aggctacact 

taaaggagtc 

gaaagaagga 

gtgggcgcgc 

aagaattaaa 

gaagaattaa 

acttatttta 

ggattgaaga 

caggccatac 

caatcctgtt 

aaatcagctc 

cagaaaggac 

ggggctcact 

gttggttgtc 

acatatgttt 

t tgtaggctg 

ctacagactt 

aactttttga 



gctcttcaca 

ggtgtaaatg 

tgtaacaacg 

acctacagtg 

cagtgctccc 

att tatgcgc 

gactgcatga 

cttaatgata 

ccaagacaaa 

tctctgttaa 

atgacaatgt 

attcaccatt 

tgtgttgggg 

ttctacaaga 

tctgtcctgt 

cagtgccgtc 

gaaggctcga 

agtgaacaat 

gaaatattga 

aattttaatc 

acccgtaata 

tgct 



atgctgtgtg 

taaatgctgc 

tccaagtgtg 

acatgcagac 

aatttcttta 

tttgggatta 

caatcatcca 

aggagggata 

ggagcttggc 

gaagaagtaa 

agcttgaaag 

agaggacatt 

caaaggtgtc 

atcctgtcta 

tgagtgtttt 

acatcgctgt 

aggcactgca 

aactttatta. 

aactaagcta 

tgtatttatt 

tttgtcaaga 



tgcaggccac 

tgatagtgat 

taagtttttg 

tgctgcagat 

tggagt tcag 

tgaacctcag 

cagggaagac 

tgttccacgt 

ctgaaacttc 

tacgattat t 

cgatgaagaa 

tagcgtgatg 

ccgtgtagca 

gtaaatagga 

cagcagctgc 

ctctgattgt 

caccttgtat 

tatgagtttt 

cagtaccagt 

gtcttttgta 

tea tagctga 
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<211> 4660 
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<400 
ggggcttaga 
gaetagagee 
ctcaagattt 
tgaagttttt 
cacataagag 
gggagcaaga 
acattgacag 
agatacagct 
aatcagacat 
ttttggaggc 
tgtcagagaa 
tctacagcat 
cactagtaaa 
actgeaggat 
agcgttatga 
gagaagattt 
ttacgggtgt 
atactagttc 
tttatgcttt 
aagtgatgac 
caatgettag 
ctttcatcat 
acactaattc 
cagctcatgg 
ccagaataaa 
acagecaagg 
accaaggaca 
tggaaggtac 
gattggcaac 
gctctcccgt 
tcccagtaac 
ccagttctcc 
aaccagcaaa 
ttcaatctga 
atgacagact 
ttttgacaac 
atgtcctgtc 
gtccctcttc 



> 26 
aat taacagg 
atctctggaa 
gaccatatct 
caacatgagt 
gaaaggatcg 
aaataaatat 
cttgagtgta 
aatgaagaga 
ctcatcaagt 
tttggatgga 
tgtaaccagc 
actgeaegtg 
tggagttcct 
gctaattcac 
agtaatgcag 
ccagtcatgt 
agaatccttt 
cctgagagct 
tttccaacct 
tcgtggcact 
cgcccacacc 
gggaattcat 
tggaatgtca 
tgtggctcgt 
ccgccagcag 
aagtttcgga 
ggccagttca 
aggaatatcc 
aaggeccagg 
tggcatgaca 
atctttacag 
agtcctcagg 
agctgagtcc 
caacagctct 
ttcagatgga 
aactgecgaa 
ttgeacagge 
tcatagctca 



ttgtttatat 
aacatcatta 
gttttgagga 
ggcctcgggg 
ccatgtgaca 
ttagaagaac 
aaaccagaca 
atggaacaag 
agtcaaggag 
tttttctttg 
tacttaggtt 
ggggatcatg 
tggcctcaag 
cctccagatg 
tgtttcactg 
ctgatttgta 
atgaccaagc 
gctggcagaa 
cagggcagag 
gcctccagcc 
aagtgtaaac 
atcatcgaca 
at tccccgag 
tcatccacat 
agctcagacc 
tgctcacccg 
cagagcagta 
ctagcacagt 
atgecaaaca 
agtagtgcct 
ggtatgaatg 
cagatgagct 
aaagataaca 
agtgatggca 
gacagtaaat 
cagcagttac 
acttccaact 
ttgacagaac 



aattggcctt 
tcccat tccc 
ttcattatga 
acagttcatc 
cactggcatc 
tagctgagtt 
aatgeaagat 
agaaatcaac 
tgatagaaaa 
ttgtgaactg 
acaatcagga 
cagaatttgt 
aggcaacacg 
agecagggae 
tgtcacagcc 
ttgcacggcg 
aagatactac 
ctggttggga 
aaccatctta 
cctcctatag 
tttgctaccc 
gggagcacag 
taaatccctc 
tgccaccatc 
ttcatagcag 
gaagtcagat 
atccctcttt 
tcatgtctcc 
attcctttcc 
gtaataataa 
aaggacccaa 
cacagaattc 
aagagattgc 
aacctctgga 
actctcaaac 
ggcatgctga 
ctgcctctgc 
ggcataaaat 



aaatgaggtg 

egggaaget a 

acaaagaagt 

cgaccctgct 

aagcacggaa 

actgtctgcc 

tttgaagaaa 

aactgatgac 

ggaatccttg 

tgaagggaga 

ggaattaatg 

gaagaatctg 

acgaaat age 

cgagaaccaa 

aaaatcaatt 

attacctegg 

aggtaaaatc 

agatttagtg 

tgccagacag 

attcatattg 

tcaaagtcca 

tgggctttct 

ggtcaatcct 

caacagcaac 

cagtcatagt 

tgtagccaat 

aaacctcaat 

aaggagacag 

tcctaatatt 

taaccgatct 

taactccgtt 

acctagcaga 

ctcaatttta 

ttcagggctt 

cagtcacaaa 

tatagacaca 

taactcttca 

tctacaccgg 



agagtgaaga 
ccct ctggaa 
ctcccaggtg 
aacccagact 
aagaggegea 
aaca t tagtg 
acagtcgatc 
gatgtacaga 
ggaccccttc 
attgtatttg 
aataccagcg 
ctaccaaaat 
cataccttta 
gaagcttgee 
caagaggatg 
cctccagcta 
atctctattg 
aggaagtgea 
ctgttccaag 
aatgatggga 
gaeatgeaac 
cctcaagatg 
agtatctctc 
atggtatcca 
aattctagca 
gttgccttaa 
aattctccta 
gttacttctg 
tcgacattaa 
tattcaaaca 
ggcttctctg 
ttaaatatac 
aatgaaatga 
ctgea taaca 
etagtgeage 
agetgeaaag 
ggaggttctt 
ctcttacagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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agggtagccc 

ctacttctgt 

atgcttcaaa 

atgagaaaga 

aaaagaaaga 

ctgaggaaat 

agttactgcc 

tggatggtgc 

ccgccactgc 

ttgataacca 

tgacagctat 

ttctctgtcc 

tggtatcctt 

gaattgacaa 

aacaagcaac 

cttctccttc 

cttcactggg 

tgggcatgca 

ttcaactaca 

tcttaaacca 

agcaaattac 

tgtacagtca 

ggcagcaaag 

acccccgtct 

atccaggaac 

catcct tggc 

agtttggcac 

gaatggttcc 

tgccgatgcc 

ggtatcagtc 

tcagtcaagc 

gcatcaccgt 

cccagatgca 

taaatgatcc 

ttctcaaaac 

agtgtacagt 

ctcaaaagcc 

agctactgac 

gatatacaaa 

ttcaggtcgt 



ctcagatatc 

gtcagtgact 

gaaaaaagaa 

tttaagatca 

acagatggat 

aaaactggag 

cacgctggag 

ggtcaccagt 

cagacccact 

atttggacaa 

aaatcagagt 

acccacaaca 

ccttagtggc 

acttgttcag 

gccacctttg 

tcctactgcc 

gacgatgcct 

gcccaggcaa 

gcagcgatta 

gtttgcagca 

acctcagcca 

acagcaccga 

ctttgggaac 

t cctcagggt 

cccacctgct 

caaccgcaac 

tggaatcaat 

ccaaggtgag 

aatccctcct 

accagacatg 

tgtccagaac 

ttccatggca 

gatgagctct 

cgcactgaga 

agaagcagat 

gaatctggta 

cacgtcagga 

tgaataacca 

tatattatat 

agcatttgga 



accactttgt 
ggacaggtac 
tcaaaagacc 
actccaaacc 
ccatgtaata 
gcccagagcc 
aaggcagcac 
gtaaccatca 
tccaggctaa 
ccaggaacag 
aaatcagaag 
gtagaaggga 
aaagatgaaa 
gggggtggat 
atcatggaag 
aatctcccta 
gttcaagtaa 
actctaaaca 
cagggacaac 
actgctcctg 
cccctgaatg 
cagaggcagc 
aacctccctc 
gctccacagc 
tctaccagcc 
agcatggtga 
cctcagatgc 
gccaactttg 
cctcagagtt 
aaggcctggc 
cagcccacgc 
ggtggaaata 
t tg'cagatgc 
cacacaggcc 
ggaacccagg 
ggcggggacc 
ccacagaccc 
cttttaaagg 
atttttctga 
gcaaaaaaaa 



ctgtcgagcc 
aaggaaactc 
atcagctcct 
tgagcctgga 
caaacccaac 
agtttacagc 
agttgccagg 
aatcggagat 
atagattacc 
gcgatcagat 
accagtgtat 
gaaatgatga 
ctgagctagc 
tagatgtatt 
aaagacccaa 
gccctttcca 
cacctccccg 
gacctccggc 
agcagttgat 
ttggcatcaa 
ctcaaatgtt 
taatacagca 
cctcatctgg 
aattccccta 
cgttt tcaca 
gcagaggcat 
agcagaatgt 
ctccatctct 
ctctgctcca 
agcaaggagc 
ctgcacagcc 
cgaatgttca 
caggaatgaa 
tctactgcaa 
tgcaacaggt 
cttacctgaa 
cccaggccca 
aatgtgaaat 
gatttttgat 
aaaaaaaaaa 



tgataaaaag 
cagtataaaa 
acgctatctt 
tgatgtaaag 
cccaatgacc 
tgaccttgac 
cttatgtgag 
cctgccagct 
tgagctggaa 
tccatggaca 
tagctcacaa 
gaaggctct t 
tgaactagac 
atcagagaga 
cctttattcc 
aggcatggtc 
aggtgctttt 
tgcacctaac 
acaccaaaat 
tatgagatca 
ggcacaacgt 
gcaaagagcc 
actaccagtt 
tccaccaaac 
actagcagca 
gacaggaaac 
cttccagtat 
aagccctggg 
gcaaactcca 
gataggaaac 
aggagtatac 
gaacatgaac 
cactgtgtgc 
ccagctctca 
tcaggtgttt 
ccagcctggt 
gcagaagagc 
ttaaataata 
atctcaatct 



gacagtgcat 
ctagaactgg 
ttagataaag 
gtgaaagtgg 
aaacccact c 
cagtttgatc 
acagacagga 
tcacttcagt 
ttggaagcaa 
aataatacag 
ttagatgagc 
cttgaacagc 
agagctctgg 
tttccaccac 
cagccttact 
aggcaaaaac 
tcacctggca 
cagcttcgac 
cggcaagcta 
ggcatgcaac 
cagcgggaac 
atgcttatga 
caaatgggga 
tatggtacaa 
aatcctgaag 
ataggaggac 
ccaggagcag 
agctccatgg 
cct gcctccg 
aacaatgtgt 
aacaacatga 
ccaatgatgg 
cctgagcaga 
tccactgacc 
gctgacgtcc 
ccactgggaa 
ctccttcagc 
gacatacaga 
gcagccat tc 



<210> 27 

<211> 6773 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gcggctggtt gcgggccggc ggcgggctgg 
tggcttcacc agctacccct gggaccggga 
actcagatca aaacctgtta ctgaaggaac 
tctggatgct cttcttctgg aaaaagcaga 
ggaaaatata aaaatgaagc aagaggttga 
taaggagttg gaacaatcac atataaacta 
gttgatggca gtacgttcca aatacagtga 
agaagcaatg aatacgcaat tagaactttc 
agataatgtt aaaaaactac aagaagagat 
aattttatat ctgcaaaagc aattagacgc 
tcaactccaa aatatcattg aggctaattc 
gcaggaagag cttttacagt tgaaagctat 
ccagattgaa gcatcagcta aggaacatga 
agagaactta gtaaaacaat gtgaggcaag 
tgagttagaa aatttaagga aagccacctc 
tattctcttg caagaaaata catttgtaga 
agaagatacc ttaaaagaac ttgaatctca 
tatgaataat cttaagttaa aacttgaaat 
tcatgaacgg gaagacttag agtttaaaat 
gggctgtgta attgaaaaat taaaatctga 



cggagatgga 
aatct aagaa 
tggtgatatt 
gactgagcaa 
ggattctgta 
tgtgaaagaa 
agacaaagct 
agaacaactt 
tgagaaaatt 
taccactgat 
tcagcattac 
acaccaagaa 
agcagagata 
tgaaaagaac 
aaatgcaaac 
acaagtagta 
acacagtatc 
ggatgctcaa 
taatgaatta 
gctagcaggt 



ggatcttgtt 
t tggagaaag 
attaaggcat 
cagtgtcttt 
acaaagatgg 
at tgaaaatt 
aacttacaaa 
aaatttcaga 
aggccaggct 
gaaaagaagg 
caaaaaaata 
gaggtgaaag 
aataagttga 
atccagaaga 
caagacaatc 
aatgaaaaag 
ttaaaagatg 
catataaagg 
ttactagcta 
ttaaataaac 



caagatgggg 
aaattgaaga 
taactgaacg 
ctctgaaaaa 
gagatgcaca 
tgaaaaatga 
agcagctgga 
acaactctga 
ttgaggagca 
aaacagttac 
ttaatagttt 
agttgatgtg 
acgagctaaa 
aatatgaatg 
agatatgttc 
tcaaacactt 
aggtaactta 
atgagttttt 
aagaagaaca 
agttttgcta 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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tactgtagaa cagcataaca gagaagtaca gagtcttaag gaacaacatc aaaaagaaat 1260 

atcagaacta aatgagacat ttttgtcaga ttcagaaaaa gaaaaattaa cattaatgtt 1320 

tgaaatacag ggtcttaagg aacagtgtga aaacctacag caagaaaagc aagaagcaat 1380 

tttaaattat gagagtttac gagagattat ggaaatttta caaacagaac tgggggaatc 1440 

tgctggaaaa ataagtcaag agttcgaatc aatgaagcaa cagcaagcat ctgatgttca 1500 

tgaactgcag cagaagctca gaactgcttt tactgaaaaa gatgcccttc tcgaaactgt 1560 

gaatcgcctc cagggagaaa atgaaaagtt actatctcaa caagaattgg taccagaact 1620 

tgaaaatacc ataaagaacc ttcaagaaaa gaatggagta tacttactta gtctcagtca 1680 

aagagatacc atgttaaaag aattagaagg aaagataaat tctcttactg aggaaaaaga 1740 

tgattttata aataaactga aaaattccca tgaagaaatg gataatttcc ataagaaatg 1800 

tgaaagggaa gaaagattga ttcttgaact tgggaagaaa gtagagcaaa caatccagta 1860 

caacagtgaa ctagaacaaa aggtaaatga attaacagga ggactagagg agactttaaa 1920 

agaaaaggat caaaatgacc aaaaactaga aaaacttatg gttcaaatga aagttctctc 1980 

tgaagacaaa gaagtattgt cagctgaagt gaagtctctt tatgaggaaa acaataaact 2040 

cagttcagaa aaaaaacagt tgagtaggga tttggaggtt tttttgtctc aaaaagaaga 2100 

tgttatcctt aaagaacata ttactcaatt agaaaagaaa cttcagttaa tggttgaaga 2160 

gcaagataat ttaaataaac tgcttgaaaa tgagcaagtt cagaagttat ttgttaaaac 2220 

tcagttgtat ggttttctta aagaaatggg atcagaagtt tcagaagaca gtgaagagaa 2280 

agatgttgtt aatgtcctac aggcagtcgg tgaatccttg gcaaaaataa atgaggaaaa 2340 

atgcaacctg gcttttcagc gtgatgaaaa agtattagag ttagaaaaag agattaagtg 2400 

ccttcaagaa gagagtgtag ttcagtgtga agaacttaag tctttattga gagactatga 2460 

gcaagagaaa gttctcttaa ggaaagagtt agaagaaata cagtcagaaa aagaggccct 2520 

gcagtctgat cttctagaaa tgaagaatgc taatgaaaaa acaaggcttg aaaatcagaa 2580 

tcttttaatt caagttgaag aagtatctca aacatgtagc aaaagtgaaa tccataatga 2640 

aaaagaaaaa tgttttataa aggaacatga aaacctaaag ccactactag aacaaaaaga 2700 

attacgagat aggagagcag agttgatact attaaaggat tccttagcaa aatcaccttc 2760 

tgtaaaaaat gatcctctgt cttcagtaaa agagttggaa gaaaaaatag aaaatctgga 2820 

aaaagaatgc aaagaaaagg aggagaaaat aaataagata aaattagttg ccgtaaaggc 2880 

aaagaaagaa ctagattcca gcagaaaaga gacccagact gtgaaggaag aacttgaatc ' 2940 

tcttcgatca gaaaaggacc agttatctgc ttccatgaga gatctcattc aaggagcaga 3000 

aagctataag. aatcttttat tagaatatga aaagcagtca gagcaactgg atgtggaaaa 3060 

agaacgtgct aataattttg agcatcgtat tgaagacctt acaagacaat taagaaattc 3120 

gactttgcag tgtgaaacaa taaattctga taatgaagat ctcctggctc gtaftgagac 3180 

attacagtct aatgccaaat tattagaagt acagatttta gaagtccaga gagccaaagc 3240 

aatggtagac aaagaattag aagctgaaaa acttcagaaa gaacagaaga taaaggaaca 3300 

tgccactact gtaaatgaac ttgaagaact tcaggtacaa cttcaaaagg aaaagaaaca 3360 

gcttcagaaa accatgcaag aattagagct ggttaaaaag gatgcccaac aaaccacatt 3420 

gatgaatatg gaaatagctg attatgaacg tttgatgaaa gaactaaatc aaaagttaac 3480 

taataaaaac aacaagatag aagatttgga gcaagaaata aaaattcaaa aacagaaaca 3540 

agaaacccta caagaagaaa taacttcatt acagtcttca gtacaacaat atgaagaaaa 3600 

aaacaccaaa atcaagcaat tgcttgtgaa aaccaaaaag gaactggcag attcaaagca 3660 

agcagaaact gatcacttaa tacttcaagc atctttaaaa ggtgagctgg aggcaagcca 3720 

gcagcaagta gaagtctata aaatacagct ggctgaaata acatcagaga agcacaaaat 3780 

ccacgagcac ctgaaaacct ctgcggaaca gcaccagcgt acgctaagtg cataccagca 3840 

gagagtgaca gcactacagg aagagtgccg tgctgccaag gcagaacaag ctactgtaac 3900 

ctctgaattc gagagctaca aagtccgagt tcataatgtt ctaaaacaac agaaaaataa 3960 

atctatgtct caggctgaaa ctgagggcgc taaacaagaa agggaacatc tggaaatgct 4020 

gattgaccag ctaaaaatca aattacaaga tagccaaaat aacttacaga ttaatgtatc 4080 

tgaacttcaa acattgcagt ctgaacatga tacactgcta gaaaggcaca acaagatgct 4140 

gcaggaaact gtgtccaaag aggcggaact ccgggaaaaa ttgtgttcaa tacagtcaga 4200 

gaacatgatg atgaaatctg aacatacaca gactgtgagt cagctaacat cccagaacga 4260 

ggtccttcga aatagcttcc gagatcaagt gcgacatttg caggaagaac acagaaagac 4320 

agtggagaca ttacagcagc agctctccaa gatggaagca cagctcttcc agcttaagaa 4380 

tgaaccgacc acaagaagcc cagtttcctc tcaacaatct ttgaagaacc ttcgagaaag 4440 

gagaaacaca gacctcccgc ttctagacat gcacactgta acccgggaag agggagaagg 4500 

catggagaca actgatacgg agtctgtgtc ttccgccagc acatacacac agtctttaga 4560 

gcagctgctt aactctcccg aaactaaact tgagcctcca ttatggcatg ctgaatttac 4620 

caaagaagaa ttggttcaga agctcagttc caccacaaaa agtgcagatc acttaaacgg 4680 

cctgcttcgg gaaacagaag caaccaatgc aattcttatg gagcaaatta agcttctcaa 4740 

aagtgaaata agaagattgg aaaggaatca agagcgagag aagtctgcag ctaacctgga 4800 

atacttgaag aacgtcttgc tgcagttcat tttcttgaaa ccaggtagtg aaagagagag 4860 

acttcttcct gttataaata cgatgttgca gctcagccct gaagaaaagg gaaaacttgc 4920 

tgcggttgct caaggtgagg aagaaaatgc ttcccgttct tctggatggg catcctatct 4980 

tcatagttgg tctggacttc gataggttga tggaaggaat atttttatta accaaataga 5040 

atctatttac aaaaatggtt cacgtatatt accacaattc ttttgtcaaa aagtgtgtat 5100 

atatgtttgc atctacatat atttgtacat ctatatgaca gatgtatttt aaaagtttca 5160 

tcttgaagta aaagtacaac agcttgaagt gttgatagca ggccacagcc ctctaactca 5220 
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tgtgatttcc catgcatgct gccagaataa aaccaccagg aatgaattca ctccccactt 5280 

ctctggaacc tcaggacccg cccatttctc ggcagtactg tgaattttga agttaaacta 5340 

aattttggta ccataccaac tggaatttag gctttaaaaa taatgtttca aggccaggtg 5400 

tggtgattca tgcctgaaat cccactactt tgggaggctg aggctggaga attgcttgag 54 60 

gctagtgagc tgtgactccc actgcactcc agctcgggga acagagcgag accttgtctc 5520 

-.^-.-^ 5580 

5640 

- - ttgtgct 5700 

aagttaaaat acagtttagt tatttgcttt aaaataaact cttctttttt tctttaaagt 5760 

10 atactatctc aaaactcatt atgttgtcag agccctagag ctggctagtg taacactgac 5820 

tatgagtagg tgggcccacc acttgagttg aggtgatttc atggtgtctt tccaggctct 5880 

tgatagggtg tcactgcatg caagccatga atctgttttg agaatcctct ccattttccc 5940 

aaataaaaac ctatcacaac agtgactata tcactcagca ttggatctaa atataaaagt 6000 

ggtgctttca gtgtttttgg cagatagtgt tccataagct ttccatcaga agggatttta 6060 

gacaccttag aggtccgtgc tacatcgtca cagttcctcc gaataacctt aggtggtagt 6120 

gttacttgcc tttgacacct ctgcatatgt tttaatgact agatccaaac tgtgttgttc 6180 

ttaaatcaaa aattggataa tttgtaatat ttatgtgtta atcacacagt atgctctctg 6240 

aagttctctt aagccttcag tttatactct taatttaatt ttctttctga gctggagaac 6300 

tggctttgca ctttggttac acagaacatt ggtttccaat tcagtttaac tgaaatttgc 6360 

20 tgctgatatg ttgagtttgt tctttaaaaa atagctcata tatctcatct ttcctcctgt 6420 

cttagaagaa cagacctaac tagtgaatgt attaatgaaa atgcatctat ttcagagctg 6480 

acatgaagag tttagttttt ttactttata aactgtgaat atgagtatgc cagctgcata 6540 

cgatgtaact aatcatattt aaatatattt cactttctct ttgactttag accttttgaa 6600 

gtctgtataa acttgttttg aaatatagtc tctgcttacg aatgtcataa caaaataatt 6660 

ttttgcatga taaaaaatta ctttgattac aaaaggcgta ttctttcatg gtttctgcaa 6720 

tgagaggaag tgtaatgatt attttaatat ttctattaaa tatgtttaac tgt 6773 



30 <210> 28 

<211> 2619 

<212> DNA 

<213> Homo sapiens 

35 <400> 28 

atggccacag cttgtaaaag atcaggagaa cctcagtctg acgacattga agctagccga 60 

atgaagcgag cagctgcaaa gcatctaata gaacgctact accaccagtt aactgagggc 120 

tgtggaaatg aagcctgcac gaatgagttt tgtgcttcct gtccaacttt tcttcgtatg 180 

gataataatg cagcagctat taaagccctc gagctttata agattaatgc aaaactctgt 240 

gatcctcatc cctccaagaa aggagcaagc tcagcttacc ttgagaactc gaaaggtgcc 300 

cccaacaact cctgctctga gataaaaatg aacaagaaag gcgctagaat tgattttaaa 360 

gatgtgactt acttaacaga agagaaggta tatgaaattc ttgaattatg tagagaaaga 420 

gaggattatt cccctttaat ccgtgttatt ggaagagttt tttctagtgc tgaggcattg 480 

gtacagagct tccggaaagt taaacaacac accaaggaag aactgaaatc tcttcaagca 540 

45 aaagatgaag acaaagatga agatgaaaag gaaaaagctg catgttctgc tgctgctatg 600 

gaagaagact cagaagcatc ttcctcaagg ataggtgata gctcacaggg agacaacaat 660 

ttgcaaaaat taggccctga tgatgtgtct gtggatattg atgccattag aagggtctac 720 

accagattgc tctctaatga aaaaattgaa actgcctttc tcaatgcact tgtatatttg 780 

tcacctaacg tggaatgtga cttgacgtat cacaatgtat actctcgaga tcctaattat 840 

ctgaatttgt tcattatcgg aatggagaat agaaatctcc acagtcctga atatctggaa 900 

atggctttgc cattattttg caaagcgatg agcaagctac cccttgcagc ccaaggaaaa 960 

ctgatcagac tgtggtctaa atacaatgca gaccagattc ggagaatgat ggagacattt 1020 

cagcaactta ttacttataa agtcataagc aatgaattta acagtcgaaa tctagtgaat 1080 

gatgatgatg ccattgttgc tgcttcgaag tgcttgaaaa tggtttacta tgcaaatgta 1140 

55 gtgggagggg aagtggacac aaatcacaat gaagaagatg atgaagagcc catccctgag 1200 

tccagcgagc tgacacttca ggaacttttg ggagaagaaa gaagaaacaa gaaaggtcct 1260 

cgagtggacc ccctggaaac tgaacttggt gttaaaaccc tggattgtcg aaaaccactt 1320 

atcccttttg aagagtttat taatgaacca ctgaatgagg ttctagaaat ggataaagat 1380 

tatacttttt tcaaagtaga aacagagaac aaattctctt ttatgacatg tccctttata 1440 

ttgaatgctg tcacaaagaa tttgggatta tattatgaca atagaattcg catgtacagt 1500 

gaacgaagaa tcactgttct ctacagctta gttcaaggac agcagttgaa tccatatttg 1560 

agactcaaag ttagacgtga ccatatcata gatgatgcac ttgtccggct agagatgatc 1620 

gctatggaaa atcctgcaga cttgaagaag cagttgtatg tggaatttga aggagaacaa 1680 

ggagttgatg agggaggtgt ttccaaagaa ttttttcagc tggttgtgga ggaaatcttc 1740 

aatccagata ttggtatgtt cacatacgat gaatctacaa aattgttttg gtttaatcca 1800 

tcttcttttg aaactgaggg tcagtttact ctgattggca tagtactggg tctggctatt 1860 

tacaataact gtatactgga tgtacatttt cccatggttg tctacaggaa gctaatgggg 1920 
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aaaaaaggaa 
gatttattgg 
cagacagatc 
ccaattacaa 
aaatcagtag 
tctcccttaa 
aatctagatt 
tctgttctga 
ctcttcttgc 
aagatgatta 
tttaatgtgc 
aaggccatca 



cttttcgtga 
agtatgaagg 
tttttggtaa 
atgaaaacag 
aaaaacagtt 
agtacttatt 
tccaagcact 
ttagggagtt 
agtttacaac 
tagccaaaaa 
tt ttacttcc 
cgtatgccaa 



cttgggagac 
gaatgtggaa 
cccaatgatg 
gaaggaattt 
caaggctttt 
cagaccagaa 
agaagaaact 
ctgggaaatc 
gggcacagac 
tggcccagac 
ggaatactca 
aggatttggc 



tctcacccag 
gatgacatga 
tatgatctaa 
gtcaatcttt 
cggagaggtt 
gaaattgaat 
acagaatatg 
gttcattcat 
agagcacctg 
acagaaaggt 
agcaaagaaa 
atgctgtaa 



ttctatatca 
tgatcacttt 
aggaaaatgg 
attctgacta 
ttcatatggt 
tgcttatatg 
acggtggcta 
ttacagatga 
tgggaggact 
tacctacatc 
aact taaaga 



gagtttaaaa 
ccagatatca 
tgataaaatt 
cattctcaat 
gaccaatgaa 
tggaagccgg 
taccagggac 
acagaaaaga 
aggaaaatta 
tcatacttgc 
gagattgttg 



<210> 29 

<211> 4263 

<212> DNA 

<213> Homo sapiens 



<400 
ggccgttccc 
ttaccttgag 
gaggacttgg 
gccacctcga 
ttccaagggg 
gtactgtgcc 
gtaggagtcg 
agcatgtcat 
gccgccacta 
tgagcatatg 
agattggaga 
attatgtcat 
ggagagaagt 
catttgaaga 
ttaaacgaat 
ttgtgcagat 
taaagtcaca 
ttgctcgagt 
acttgtcacc 
tgggctgtgt 
aaaacctggt 
atgatctccg 
tcaactccat 
agcttattgc 
caggtaaaag 
caaagcctgc 
tacttgagaa 
tgaattctgg 
aatttattga 
tgaagcggca 
ggaatgtttt 
gggctgtctc 
ttgaccaaat 
atggtcgatg 
tggaagaatt 
acgtggttta 
agattaaagc 
caactgaaga 
cacgtgctgc 
aaaagaaagt 
ctttggaact 
ctgtgacttc 
aagaaatgga 
atgaaaatct 
agaccgttgg 
gcaagaagtg 
catccttctc 
gctctccaag 



> 29 
ctctcctcag 
cctgtgcagt 
gcgacaggta 
cctgtgaagt 
tcaggaaccg 
tgacccgacg 
cagcctgggc 
ttccaaaatg 
gaccggtgct 
tatgaaaatg 
aggttcattt 
caaggaaatt 
tgctgtattg 
aaatggctct 
aaatgctcag 
atgtttggct 
gaacatattt 
tcttaatagt 
tgaaatctgt 
cctttatgag 
actgaagata 
cagcttgctg 
attggagaaa 
agaagaattt 
accagcatca 
tgctaaatac 
aaaaccaccc 
agaagaaagg 
gaaagaaaag 
agagaagcag 
aagggctggt 
tccatcaccg 
gcagcggcta 
gctcccagaa 
cctacagcgt 
tttggcaaga 
caaacttcgt 
gactgacatg 
tgtactaaaa 
atgggaagaa 
t cttgaaaca 
agctttgaaa 
aaagagtaac 
taaagctcaa 
tcacaaagat 
ggagatggga 
tgcaaccgaa 
aaaagtctgg 



cagtagctct 

tgccctcacc 

gcctcccagt 

tgggggcggt 

ctgtgtttgt 

gtggcaagtc 

tgtgagcggc 

catcgtggtg 

tatggtcgcc 

ccttaggagg 

ggaaaagctg 

aacatctcaa 

gcaaacatga 

ctctacatag 

aaaggcgctc 

ctgaagcatg 

ctaaccaaag 

actgtagagc 

gaaaacaagc 

ttgtgtacac 

atctccggat 

tctcagttat 

ggttttatag 

tgtctaaaaa 

ggacaaggtg 

ggagtgcctt 

ccaaaacata 

aagaaaatgt 

aagcaaaagg 

cggttggaga 

ggaagcggtg 

tgttctcctc 

agagcagaag 

aggcaaaaag 

aaacgagaag 

ctgaggcaaa 

ggtgagaata 

aggctcaaaa 

gaacagct gg 

catttggtgg 

ggtggttctc 

gaagtgggcc 

agtgctattt 

gaggatgaaa 

gagagagagt 

ggtcagcttg 

aaacatactg 

gggaaaaacc 



atggtttcag 

ccggaatcca 

cccacacgct 

acccagcaac 

t tcgtccgcg 

tgacgcgt ca 

tgctgggtag 

cttctgcctt 

actgttattc 

gaaccatgga 

ttcttgttaa 

gaatgtctga 

agcatccaaa 

taatggatta 

tgtttcaaga 

tacatgatag 

atgggacagt 

tggctcgaac 

cttataacaa 

ttaaacatgc 

cctttcctcc 

ttaaaagaaa 

ctaaacgaat 

cactttcaaa 

tcagttcttt 

taacatataa 

aacaggccca 

ctgaggaagc 

atcagattag 

ggataaatag 

aagtaaaggc 

gaggccagta 

ataatgaagc 

gacacttagc 

ctatgcagaa 

taagactaca 

aagaagctga 

agatggagtc 

agcgaaaaag 

cgagggtaaa 

catcaaagca 

tggatggaag 

caagtaagcg 

aggaaaagca 

atgagacaga 

tgattcctct 

tgggagaggt 

ctacagattc 



ggcggcaacg 

tagtcactgt 

gcgggtccgc 

tccccctgtg 

tagccagggc 

gccagagacc 

acagacttgc 

aagtcctata 

tgactcaggt 

gaagtatgtg 

atcgacagag 

taaagaaagg 

tattgtccaa 

ctgtgaagga 

agaccagatt 

aaaaat tct t 

gcagcttgga 

ttgcataggc 

taaaagtgac 

atttgaagct 

agtgtctcca 

tcctagggat 

cgaaaagttt 

gtttggacca 

tgtccctgct 

gaagtatgga 

tcaaattccc 

agcaaaaaaa 

gttcctgaag 

ggccagggaa 

ttcctttttt 

tgaacattac 

aagatggaag 

tgtagagaga 

taaagcccga 

aaattttaat 

tggtaccaaa 

acttaaggcg 

aaaggaagct 

aagctcagat 

gcaggtgaag 

tttaactgat 

agaaatcctg 

gcatcactca 

aaatgccatt 

cgatgcagtg 

tattaaatta 

tgtgctgaag 



tgcagcgtcc 
gacgaggcgg 
gcctggccaa 
cagccgccgt 
gggtcgcgga 
ggtgcccggt 
tttctcttac 
ggaagacact 
cccgtgtcat 
agactgcaga 
gatggcagac 
caagaatcaa 
tataaagaat 
ggtgatttgt 
ttggactggt 
caccgagaca 
gattttggaa 
actccatact 
atttgggctt 
ggaaacatga 
cattactcct 
agaccatcag 
ctctcccctc 
cagcctctcc 
cagaaaatca 
gataaaaagt 
gtgaagaaaa 
agaaggttgg 
gctgagcaga 
caaggatgga 
ggcattggag 
catgccattt 
gggggaatct 
gccaaccaag 
gccgaaggac 
gagcgccaac 
ggacaagaag 
caaacaaatg 
tatgaaagag 
gttcctctgc 
cctgtcatt t 
acccaggaag 
cgtaggctaa 
ggttcttgtg 
tcctctgatc 
acactggata 
gattctaatg 
atacttggag 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2619 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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aagctgaatt 

atgctgaaga 

caaaagaaat 

ctgaggtgtc 

aagttctaca 

atgtgtggac 

tgcagcagag 

atcagagaga 

tacagccaga 

cagttcattg 

aaactaagag 

atccaaagat 

tggacgttcc 

atgaaccaat 

cagatttaca 

aatacagtga 

ggacagatgc 

gcactcagat 

ccatttagag 

ggtttatgag 

aacaatagtt 

tttagtcatg 

tttaatagtc 

ata 



acagctatag 

ggagaactac 

aaatccatca 

tccacaaatg 

agagccaagt 

tagagagaag 

tgaagtttgt 

tcctgcagta 

atcgattttc 

ctcaccagaa 

caagaattcc 

gctgaggacc 

cactgtgggg 

taaagaaggg 

agagcttcag 

ggaggaagag 

cccagacgag 

aatagtgacg 

gaactaagac 

aaagtaaagg 

gagaatattt 

gcagatggag 

aactgtaaga 



acagaactac 

aaacccttac 

gctactgttg 

tcagaaggaa 

agcacacaca 

gaagcagcta 

gaagatagaa 

gacgattctc 

cagaaagtgg 

gaaccaattc 

ttactgattg 

tgctcacttc 

gacgttcatc 

ccttctgatt 

gcctcaatgg 

tctgttttaa 

gaggacaacc 

ctgagaccac 

ttcacttgga 

ctattcatga 

tgggcaatga 

cctatcagga 

acacatttga 



tagaaaacac 

ttactgaaga 

attctactga 

atgtggaaga 

cagatgggag 

aggaaactga 

ttccagggaa 

cgcagtctgg 

ttcattctaa 

caattcgatc 

gactttcaac 

cagatctttc 

aagacagtct 

ccgaagacac 

agcagctgct 

aaagcagcga 

ccagcagcga 

tagtgaatgt 

gcaagaaatg 

ggatgaagat 

gcaccagcat 

agataatgat 

acttggctca 



atcttttaaa 

agagaatctg 

aacgaaaagt 

acctgatgat 

tttgccacct 

gttggaagat 

cgtggaccaa 

ctgtgatgta 

ggacttgaac 

tcactctgat 

tggtctgttt 

caagctgttc 

tgaaatcgat 

tgtatttgaa 

tagggagcaa 

tgtggagcag 

aagcccctga 

gaatatgaca 

ggctttgaaa 

gaaaatattg 

ctctatgcca 

gaataatcct 

taatacaagc 



agtgaggttt 

cagtgcattt 

ccaaagttta 

ttggaaacag 

gttcttaatg 

aaggttgctg 

tcctgtaagg 

gagaagtcag 

ttagttcagg 

tctccaccaa 

gatgcaaaca 

agaaccctaa 

gagctggaag 

gaaactgaca 

ccaggtgacg 

acagcaagag 

acgaggaatg 

gtgtctttaa 

agttctttga 

aaatttgttc 

agattctgca 

caggacattc 

ttcctgggaa 



<210> 30 
<211> 1756 
<212> DNA 

<213> Homo sapiens 



<400 
tcgggcgcag 
gtgcccgtgg 
cccaccgtca 
ctgactcgct 
cgccgggagg 
gccatgaaag 
gcgaaagaag 
cacatgatcc 
tgcgacgacc 
tacaaagact 
gaagccagct 
gtgattgacg 
atcgtgt act 
ggcgagaacg 
ggcaagaagt 
cgcgtgtgga 
tacgagcagt 
gaggagatca 
taccgctacc 
atggagctgg 
ctgcttgcct 
cacaccgtcc 
aacgtggagt 
ccgctcatga 
cgcatcaaca 
cccccggagg 
gactcctcca 
cttagcttgt 
agggtggggt 
tgtcccctcg 



> 30 
ccgcgaagat 
accacaggcc 
tcgtcatggt 
acctcaactg 
ctgtgaagca 
tccggaagca 
ggggacaaat 
ttcattttgc 
ctacagttgt 
gcaactcggc 
accagcccct 
tgggccggag 
acctgatgaa 
agcacaacct 
ttgccagtgc 
ccagccagct 
ggaaggcgct 
gggacaccta 
ccaccgggga 
agcggcagga 
acttcctgga 
tgaaactgac 
ccgtctgcac 
gacgcaatag 
gctttgagga 
tgcccacgca 
ggaaacactg 
gtcctgccct 
ggagcagcgg 
gccgct 



gccgttggaa 
ctcgttgccc 
gggcctcccc 
gattggcgtc 
gtacagctcc 
atgtgcctta 
tgcggttttc 
caaagaaaat 
ggcctccaat 
agaagccatg 
cgaccccgac 
gttcctggtg 
catccacgtg 
ccagggccgc 
tctgagcaag 
gaagagcacc 
caatgagatc 
ccctgaggag 
gtcctaccag 
gaatgtgctg 
taagagtgca 
gcctgtcgct 
acaccgggag 
tgtcaccccg 
gcatgtggcc 
gctgcctgga 
aggcagacgt 
ccgcccgagg 
gggagccttg 



ctgacgcaga 

agatcctgtg 

gcccggggca 

cccacaaaag 

tacaact tct 

gctgccttga 

gatgccacca 

gactttaaag 

atcatggaag 

gacgacttca 

aaatgcgaca 

aaccgggtgc 

cagccgcgta 

atcgggggcg 

ttcgtggagg 

atccagacgg 

gacgcgggcg 

tatgcgctgc 

gacctggtcc 

gtcatctgcc 

gaggagatgc 

tatggctgcc 

aggtcagagg 

ctagccagcc 

tccacctcgg 

caaaacatga 

gtcggttcca 

caaaacgtat 

gccgaagaga 



gccgagtgca 
ggccaaagct 
agacctacat 
tgttcaacgt 
tccgccccga 
gagatgtcaa 
atactact ag 
cgtttttcat 
ttaaaatctc 
tgaagaggat 
gggacttgtc 
aggaccacat 
ccatctacct 
actcaggcct 
agcagaacct 
ccgaggcgct 
tctgtgagga 
gggagcagga 
agcgct tgga 
accaggccgt 
cctacctgaa 
gtgtggaatc 
atgcaaagaa 
ccgaacccac 
ccgccctgcc 
aaggctcccg 
ttccatttcc 
cctgaggact 
accatgcttg 



gaagatctgg 
gaccaactcc 
ctccaagaag 
cggggagtat 
caatgaggaa 
aagctacctg 
agagaggaga 
cgagtcggtg 
cagcccggat 
cagttgctat 
gctgatcaag 
ccagagccgc 
gtgccggcac 
gtccagccgg 
gaaggacctg 
gcggctgccc 
gctgacctac 
caagtactat 
gccagtgatc 
cctgcgctgc 
atgccctctt 
catctacctg 
gggacctaac 
caaaaagcct 
cagctgcctg 
gagcagcgct 
atttctgcag 
tct tccggag 
gcaccgtctg 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4263 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1756 



<210> 31 
<211> 1661 
<212> DNA 

<213> Homo sapiens 
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<400 
tgctgcagcc 
acgtgcattc 
gatttataat 
tggtgctgtt 
tgctcttgtt 
cagccttttc 
ctccaagctg 
tttggtccag 
cccgaactgc 
gaagataagt 
ccgattccgt 
ggtccttggg 
tgtctctctg 
agaggttcac 
atcctgggct 
gcgggtgcac 
ccttagtgtt 
gacttctgag 
ggagctgcaa 
attctaggtg 
attttaagat 
aaatccacag 
agtgtgaaat 
gccccttgag 
acatgcctag 
caatatcatg 
gtgtacatta 
gtgttatacc 



> 31 
gctgccgccg 
ccgattcctt 
cttctaaagg 
ggcatggcct 
gatgtcatcg 
cttagaacac 
gtcattatca 
cgtaacgtga 
aagttgctta 
ggttttccca 
tacctgatgg 
gaacatggag 
aagactctgc 
aagcaggtgg 
attggactct 
ccagtttcca 
ccttgcattt 
gaagaggccc 
ttttaaagtc 
gaggttgtgc 
ggactgggaa 
ctatatcctg 
agttctgcca 
ccaggtggat 
tccaacattt 
tcttgtgcat 
ccatataatg 
aactaaaacc 



at tccggatc 

ttggttccaa 

aagaacagac 

gtgccatcag 

aagacaaatt 

caaagattgt 

cggctggggc 

acatatttaa 

ttgtttcaaa 

aaaaccgtgt 

gggaaaggct 

attccagtgt 

acccagattt 

ttgagagtgc 

ctgtagcaga 

ccatgattaa 

tgggacagaa 

gtttgaagaa 

ttctgatgtc 

atgttgtcct 

aaacatcaac 

atgctggatg 

cctctgacgc 

gtttaccgtg 

tttcccagtg 

aattcttcca 

taaaaagatc 

cccaataaac 



tcattgccac 

gtccaatatg 

cccccagaat 

tatcttaatg 

gaagggagag 

ctctggcaaa 

acgtcagcaa 

attcatcatt 

tccagtggat 

tattggaagt 

gggagttcac 

gcctgtatgg 

agggactgat 

ttatgaggtg 

tttggcagag 

gggtctttac 

tggaatctca 

gagtgcagat 

atatcatttc 

ttttatctga 

tcctgaagtt 

gtattaatct 

accactgcca 

tgttatataa 

agtcacatcc 

aaggatct ta 

tacatacaaa 

cttgaacagt 



gcgcccccga 

gcaactctaa 

aagat tacag 

aaggacttgg 

atgatggatc 

gactataatg 

gagggagaaa 

cctaatgttg 

atcttgacct 

ggt tgcaatc 

ccattaagct 

agtggaatga 

aaagataagg 

at caaactca 

agtataatga 

ggaa taaagg 

gaccttgtga 

acactt tggg 

actgtctagg 

tctgtgatta 

agaaataaga 

tgtgtagtct 

atgctgtacg 

cttcctggct 

tgggatccag 

ttt t gtgaac 

caatgcaacc 

g 



cgaccgcccg 

aggatcagct 

ttgttggggt 

cagatgaact 

tccaacatgg 

taactgcaaa 

gccgtcttaa 

taaaatacag 

acgtggcttg 

tggattcagc 

gtcatgggtg 

atgttgctgg . 

aacagtggaa 

aaggctacac 

agaatcttag 

atgatgtctt 

aggtgactct 

ggatccaaaa 

ctacaacagg 

aagcagtaat 

atggtttgta 

tcaactggtt 

tactgcattt 

cct tcactga 

tgtataaatc 

tatatcagta 

aactatccaa 



<210> 32 

<211> 4169 

<212> DNA 

<213> Homo sapiens 



<400 
ggcggcttcc 
gctgctgcta 
cgacgagtct 
agacatggcg 
tccagagcac 
tgaaagcata 
aggaagccct 
caatgccaac 
tcgcgatggg 
tgatgtaact 
taaaacaaag 
caaggaagaa 
gctgtggggg 
ctcaggcaac 
ccagtttgtg 
taattaatac 
tgactgtgtt 
tcccctccaa 
cagagaagac 
tttcattatg 
agggtgtgta 
aatcctaagt 
ggtgtctgtc 
aggaaagagt 
ggatttaagt 
ttgtttctaa 
atgatgtgct 
agttagaaca 
tcaattgtga 
aaagaatggt 
gtggaagctg 



> 32 
aggtgggcgc 
ctgctaccac 
accccacctc 
cgtcttctgg 
aagagacctt 
ttgaaaatga 
actgagaagg 
tatgacgtcc 
agctacgcct 
ctggagggcc 
caagacaagg 
aatcgagctg 
gacaggtgga 
tgcacaccgg 
gtcaaagaat 
tggacactga 
gctttttccc 
gcacctgctg 
tttaaaactc 
ttaaaggtaa 
aatccaaaat 
agttttgaat 
aggttttatg 
actgctggtg 
agggaatcag 
acatttttct 
actgttgatg 
tttatataca 
ttttaataaa 
acaaagtgaa 
ctgttaacaa 



gcaaggccgt 
cgcctgggtc 
cccggaagaa 
agcaatggga 
cagcacctgt 
cgaaaaaagg 
agacagagga 
agaggttcat 
gggagatcaa 
aggtgtaccc 
gcaaaaaaaa 
ggaataaaag 
cgtggagagc 
atgacattct 
tactgcttaa 
caaattcatg 
atttacactt 
tgattgggtg 
ttgacttaat 
aatatgcttt 
ctatgcagga 
ttctttgact 
tggtctgtaa 
tcgaggggtt 
agtcctttgc 
ttgataaagt 
tctgattttg 
gatgtcattt 
ttaaaagtag 
aagtgtatcc 
tttgtgcccc 



ggtcctgctt 
ctgcgcggcc 
gaagaaggat 
gaaagatgat 
cgacttctca 
gaagactctc 
aattacgagc 
tgtgggatca 
ggactttttg 
cggcaaagga 
gaaggaagga 
agaagacctg 
tctttgccca 
agtgtcttct 
taggcttcaa 
tttactataa 
ggtgagtcat 
gcctgctctg 
tgagtaaact 
gtgaactcag 
aatgcggaga 
tgaatcttac 
agttaggggt 
atatgttcca 
agagtgtgac 
gcctaaatct 
ccgt tcatgt 
ctaagaacta 
cacattacat 
ctctcttgcc 
caca tccccc 



tgtgcctctg 
gaaggctcgc 
attcgcgatt 
gacat tgaag 
aagatagacc 
atgatgtttg 
ctctggcagg 
gaccgtgcta 
gtcggtcaag 
ggaggaagca 
gatctgaaat 
tgatggggca 
gctcctgggg 
agaaagggtc 
gtaagaagac 
aatctcctta 
caactctact 
atcagatagc 
cttcatgcca 
atgtctgtag 
atagaaaata 
tcatcagtaa 
tctgttttgt 
tttaatgtga 
agacgactca 
gtgct ttcgt 
tagagcctac 
aaattctttg 
ggttagaaaa 
gccggtggta 
tccctgccca 



acctgctgct 
ccgggacgcc 
acaatgatgc 
aaggagatct 
caagcaagcc 
tcactgtatc 
gcagcctttt 
tctt catgct 
acaggtgtgc 
aagagaaaaa 
ctcggtcttc 
gcagtgacgc 
tgggagtggt 
tgccacatga 
agatgttttc 
catggaaatg 
gagattccac 
aaattctgat 
tatacatcat 
ccaggaagcc 
tgtcacttga 
gagaactctt 
ttccttattt 
cagttttaaa 
ataacctcat 
atagagtaac 
tgtgaataag 
ggaaaaaccc 
tgtcagtgtt 
gcttgtccca 
tccaccaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1661 
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aaaagtacat 
gcatctgttt 
tgcattctca 
gccatgccct 
aagaggcttt 
gcccttggag 
acgagtccct 
actcagggca 
gctgttttga 
ctgtaccttc 
tgtttcctgt 
aggtactaca 
ttcggtgggg 
tgaaaggggc 
ccttctctga 
ctgagctaag 
gaggctgggg 
gtttgggacg 
tgcactggtt 
tgggaggtgc 
gtgaggtttg 
attttttgct 
cttttgttta 
taacattttg 
ccaacaatag 
aaaaccatat 
ttcacgtaga 
ttcccagccc 
gcaatcctag 
gtctgcagat 
cccagtactt 
ctggccaaca 
ctgtaatccc 
ggttgcagtg 
gtctcaaaaa 
acagggtctc 
tctacctacc 
cctgttttta 
ataaatttgg 



ttacttatgt 
ccccttaatt 
gttctttgct 
ctccagggtg 
tcttgttcct 
aagagaactg 
gaatcatgcc 
gtttgatggg 
tgagataatt 
tgtttccact 
gcagtgtgcc 
aggtgccagg 
gaagacaaat 
agggtagggg 
gggcaagagg 
accctgtggc 
cccagtgcag 
ccctggtgga 
gcacaggaga 
ttcagtagtg 
aaccactgtt 
ggtatattca 
tatctgaaag 
gtgtgtattg 
atctatcctt 
cttaattggc 
gagagggata 
aggcacagac 
ttgacactga 
tcatgagcaa 
tgggaggccg 
tggtgaagcc 
agctactcag 
agccaagatc 
aacgaaacaa 
actctgttgc 
ggctcaattg 
atgaatttgt 
atgatatctt 



aaatgtactt 
tggtagctgc 
ttggatggga 
aggcctgtgt 
cctgcatctt 
tctgacagtg 
gtggctgaac 
gttactgtcc 
gctttaatta 
taggagcctt 
cccacttcac 
acggtgtaga 
gtcagtaatg 
gaggtagaat 
aggagaggag 
tagaagtggc 
gcccaggttg 
tcttaattgt 
agtgatcaga 
caggccatag 
tcactgcata 
gtgctgcagt 
gtaaattcct 
ccaaattgcc 
gccagccttg 
taccctgaag 
cccaggaggc 
atgtggataa 
ggggagccaa 
aagaaatgtt 
aggtgggtgg 
ctgtctctac 
gtggctgagg 
aggccactgc 
aaatttttta 
cctcactaga 
atcttcccac 
cttccttttt 
ttgactatc 



atggtgatgt 
tcacatttcc 
catttgccct 
gatctaccgt 
gaacctggag 
gggagagagc 
caagccctgt 
tgcatagcca 
agcaaaaggt 
cccatgtcag 
ccagtagttt 
tacagcctct 
tacaaagtaa 
cttccctgac 
aaatagagta 
agagggagag 
gaggagcgta 
gatggggtgg 
agaggacccc 
ctgaaggtgt 
gtccctgctg 
ggacattttc 
agcagtagaa 
ct cccaagtg 
ggcatcacat 
ccaccatact 
ccacctgctc 
gatttaaaca 
cataagctga 
gggctgggta 
atcagttgag 
taaaaattag 
caggagaatg 
actccagcct 
agagaaatgt 
gtgcagtagg 
cacagcctcc 
gtcttgtttg 



atgtttgttt 
ctcgaaagaa 
gcagtccccc 
actagggtac 
cgggagctgt 
gccacaccct 
ctgtgggctt 
taatggccca 
agcaaagctt 
aatgtgcaga 
ctgtgtgtct 
gctatcgtaa 
aatggcagct 
caggttaaga 
aggcaggcag 
gcagcaggaa 
gcacatggag 
gtgtgaaagg 
agcaggtgtg 
cctacatcag 
atggacactt 
atacaaaata 
ttattagagc 
gtt tagtcag 
ttaccagt t t 
ggagaggctg 
caaccccagc 
ttt ccagccc 
gctgagaaac 
cagtggctca 
gtcaggagtt 
ccgagtgtgg 
gcttgaaccc 
ggatgacggg 
catttgtttt 
gatcacggct 
caaatagctg 
ttttaatatc 



tggcctcaca 
ccacaccctc 
caccctccag 
taggccctga 
tgtaggcccc 
ggtggcataa 
tttctgttgt 
gtataaagca 
tcactccgcc 
tctgtctcat 
gttatgtact 
aactcaatga 
gttagaagta 
aaaccagagg 
aggaaacagt 
ggccagcggg 
tttggtagga 
cagtccaggt 
agccgtgagc 
cagggtgatg 
gagtgttcag 
tttcggtaca 
aaacggaatt 
cttacccttg 
aatagattgt 
cgtacagtgt 
tgcatgagtc 
cagccttcaa 
agtctgccca 
cgcctgtaat 
tgagaccagc 
tggtgcgggc 
gggaggcgga 
atgagactct 
tgtttttgag 
cactgaagtc 
ggagaaatgt 
tagtgatcta 



<210> 33 

<211> 859 

<212> DNA 

<213> Homo sapiens 



<400> 
catgccatgc 
cgactggttg 
tgccaggaag 
ctggaatctc 
gaggaaatta 
gagcacaaac 
ctaaacacaa 
tgtttgcgag 
aaggagacct 
gtcatgttct 
gaaatccagg 
gggggcagtg 
gagcaacaga 
agctaaatgt 
ccttagagat 



33 

agcttaccaa 
caagtaaaac 
agaaaataga 
agcgaatcta 
acaacatgaa 
taaatgatct 
aagtggccaa 
ccaaccaagt 
gtgaccaaaa 
acctggagac 
agggacagat 
ggaagttgcc 
catccctgag 
gagggtgggc 
ggatgaggc 



ccatcgagtc 
agatggaaaa 
tgccttacag 
ctgggaaaac 
gaccaagttt 
cctaaaagaa 
actcaccaac 
cctcctgcag 
agatctgcag 
acagcagaag 
caacatcgcc 
ctccaggaag 
actgttctcc 
cctaataagt 



tgggactatg 

atagtacagt 
ttagagtatt 
aagatagttc 
aaagaaacaa 
aagcagtctg 
gagctcaaag 
aacaagctaa 
atcaccgaga 
atcaaccatc 
atggcctcgg 
ggccgcagca 
ctgacactgt 
acaagtgagg 



ctggagataa 
atgaatgtga 
catatttact 
ggatagagaa 
ttgagaagtg 
tggaaagaaa 
aggagcagga 
aagaggagga 
tccaggagca 
tgcctgccga 
cctcgagccc 
agaggggcaa 
gagagtgtgc 
acgaaggccg 



ctatgttcat 

gggggatact 

aacaagccag 
ggacacagca 
tgataatcta 
gtgcactcag 
aatgaacaag 
gagggtgctg 
gctgcgtgac 
gacccggcag 
tgcct cttcg 
gtgacct tea 
tgggaccttc 
gect t cgtgg 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4169 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
859 



<210> 34 

<211> 1070 

<212> DNA 

<213> Homo sapiens 
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<400> 34 

gcgattgctg gggctgcagc gotgcctccg 
ggaagggtgc gataaggccg ggcgaggtgc 
agatctaatt gggtagggcg ggtgtagact 
gcctcctggg cgcccgcggg cccggcgaga 
ctggggtccg aggcgcgcct ctcacgccct 
tggagacgca gtttccttgt taatgacaat 
acgcagcgaa atcgagcctg gccttgaggg 
ttccagagcc taagcctgga gacctgattg 
ccatctatgt tggcgatgga tatgtggttc 
ctggtgcagc cagtgtcatg tccgccctga 
tgtatgatgt ggccgggagt gacaagtacc 
cgccgctgcc ctgcacgaaa atcatccagc 
tctacaagct gaccagtgag aactgcgagc 
cccgcagtga ccaggtcaga gatgtcatca 
cagccatgag ccttattgga gtcatgttct 
aaaagactgt ctgtcagcga tgactttata 
ttggggtttg gtttgtggat ttcattgtga 
aataaagcaa aacgagggag gattttattg 

<210> 35 

<211> 460 

<212> DNA 

<213> Homo sapiens 



agaccgagag 

ctgggatgct 

agcctgccga 

aagttgttaa 

gcccaacaga 

aaatccctgc 

tccacaccgc 

agatttttcg 

atctggcccc 

ctgacaaggc 

aggtcaacaa 

gggcggagga 

actttgtgaa 

tcgctgcaag 

caagaaacaa 

ca.tcaagggg 

tttataataa 

ggggagtgca 



tgggtggagc 

tctccccttc 

gccgcccgct 

agggagcgag 

agccgcagtc 

tccccctgcc 

gaggaagatg 

ccct ttctac 

tccaagtgag 

catcgtgaag 

caaacatgat 

gctggtgggg 

tgagctgcgc 

cgttgcagga 

gcgacaaaag 

gtcttgtttt 

ggcttatttt 

gcccaaaaaa 



gggtcttcct 
cgcgaggaag 
ggcacctgca 
gtggttgttc 
ccgtggggtc 
tcagacatct 
cgtgcgccca 
agacactggg 
gtcgcaggag 
aaggaattgc 
gacaagtact 
caggaggtgc 
tatggagtcg 
atgggcttgg 
caataactga 
gctagagagt 
cacagaataa 



cttt 
caga 
cctc 
acga 
ccca 
cccc 
atga 
aaat 



<400 
tcctcc 
cgccgc 
ccccag 
ccggct 
gggtat 
aggctc 
gaagaa 
tcctgc 



> 35 
catgtcgcca 
cgcgatgcgc 
cgccaaggac 
caacaaggtt 
tggcaagctt 
tgcagcccct 
ggaggagtct 
tcccctgcaa 



ccgaggtgcc 
tacgtcgcct 
atcaagaaga 
atcagtgagc 
gccagtgtac 
gctgctggtt 
gaagagtcag 
ataaagcct t 



acgcgtgaga 
cctacctgct 
tcttggacag 
tgaatggaaa 
ctgctggtgg 
ctgcccctgc 
atgatgacat 
t ttacacatc 



cttctccgcc 
ggctgcccta 
cgtgggtatc 
aaacattgaa 
ggctgtagcc 
tgcagcagag 
gggatttggc 



<210> 36 

<211> 4416 

<212> DNA 

<213> Homo sapiens 



<400 
gccgcacagg 
atattttatt 
tt tcatgaaa 
tattcctacc 
ccagaatgga 
agatgctttg 
ttcaagtagt 
gcaggtcatc 
agttactaaa 
gaagaagtct 
atcaaaaagg 
gaagaagaaa 
tattcatgca 
taagagagag 
gactaaaaaa 
tcttctgtcc 
gctaaaggag 
gactacgctt 
tgctgctgct 
aagaagcaat 
taatgtattt 
gtttcagcaa 
tatactgaga 
actaaataaa 
aaagctacag 
ggttcaacta 



> 36 
gttttatagg 
gttcttattc 
gaaacattat 
aaaacagata 
aacctccatg 
gcagtagaag 
gttagggaaa 
aaagaaagtg 
cagcccaccc 
aaaaatggaa 
caagaagcat 
gcttcatcaa 
actacttata 
gttattgatt 
ctgaagaccg 
ttgaagacta 
aagtctggtg 
atacatcagc 
acaaaggatc 
gtggttataa 
caaaacaaaa 
gttcgtgagc 
gatgcagtca 
ctacgccagg 
caagaggaag 
caagaagctg 



atcacattga 
cttcaatagt 
atgatgaagt 
aaaagaaagc 
aatccgactc 
atgatcaagt 
gaaaaaagaa 
acgcatcaaa 
ctccctctga 
gcgatgacca 
tgcccctcca 
agaaacaaaa 
ttcctttgat 
tgcttaaacc 
aaactgacaa 
tgatgttttc 
taatacaaga 
ttcaagaaaa 
ggtgtaagca 
caaggatgaa 
tacatgtcag 
agatggaggc 
gcaacactac 
attatgctag 
tccaaaagaa 
agagaaggtg 



caaaagtacc 
tattacagta 
tcttgcaaaa 
agaaaagaaa 
tgagagtgta 
tgcacctgtt 
ggaaaagaaa 
gattcctggc 
agcagctgcc 
ggataaaaag 
ccaagagact 
gacagaaaat 
ggataatgct 
tgaccaagta 
agaaaatgct 
tgaagatgag 
tgctttaaag 
ggacaagtta 
gttaacccag 
agatcgaatt 
ttatcaagag 
agagatagct 
aaatcaactg 
gttggtgaat 
gaatgctgag 
ggaagaagtt 



atggagtttt 
attttcctct 
cagaaaagag 
aagaataaaa 
cctcgagact 
ccattgaatg 
caaaagcctg 
aaaaaagtag 
tcgaagaaga 
gtggaaactc 
aaacaagaaa 
gtcttcgtag 
gactcaagtc 
gaagggatcc 
gaagtgaagt 
gctctttgtg 
aagtcaagta 
ctcgctgctg 
gaaatgatga 
ggaacattag 
actcaacaga 
cacttgaagc 
gaaagcaagc 
gagctgactg 
caagcagcta 
cagagctaca 



gcctccgccg 

gggggcaact 

gaggcggacg 
gacgt cattg 
gtctctgctg 
gagaagaaag 
ctttttgatt 



atgagtcagc 
tcttctggct 
aacaaaagct 
agaaagaaat 
ttaaattatc 
tcgttgaaac 
tgcttgaaga 
aacctgtccc 
aaccagggca 
tcatggtacc 
gtggatcagg 
atgaacccct 
ctgtggtaga 
agaaatctgg 
ttaaagattt 
ttgtagactt 

agggagaatt 

tgaaggaaga 
cagagaaaga 
aaaaggaaca 
tgcagatgaa 
aggaaaatgg 
agtctgcaga 
agaaaacagg 
ctcagttgaa 
tcaggaagag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1070 



60 
120 
180 
240 
300 
360 
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fo™! 9aa ca ^aggcag cacagcaaga tttacagagt aaatttgtgg ccaaagaaaa 162C 

tgaagtacag agtctgcata gtaagcttac agataccttg gtatcaaaac aacaattaaa 1680 

gcaaagacta atgcagttaa tggaatcaga gcagaaaagg gtgaacaaag ga J ?c "5S 

gttcc^tcc SL ta ""ggagca gaatgaggct ttgaaagctc aaatLagca 18o2 

aataat^aca S cc "gacctc cgcttcagtt ctagcagaag aattacataa 1860 

tnlt^lf^t gaaaa ^ ata agcagataaa acagactgaa gattctttag caagtgaacg 1920 

tgatcgttta acaagtaaag aagaggaact taaggatata cagaatatga atttc?tatt 1980 

aaaagctgaa gtgcagaaat tacaggccct ggcaaatgag caggctgctg ctgcacatga 2040 

attggagaag atgcaacaaa gtgtttatgt taaagatgat aaaataagat tgctggaaga 2100 

caaaacat^a aSS""' ^ aacaaaat ^aagaattt aagattcfaa atgaccaaC 216^ 

caaagcatta aaatcagaag ttcagaagct acagactctt gtttctgaac agcctaataa 2220 

ggatgttgtg gaacaaatgg aaaaatgcat tcaagaaaaa gatgagaagt taaagactgt 2280 

ggaagaatta cttgaaactg gacttattca ggtggcaact aaagaagagg agctgaatgc 2340 

aataagaaca gaaaattcat ctctgacaaa agaagttcaa gacttaaaag ctaagcaala 2400 

tgatcaggtt tcttttgcct ctctagttga agaacttaag aaagtgatcc atgagaaaga 2A60 

tggaaagatc aagtctgtag aagagcttct ggaggcagaa cttctcaaag ttgctaacaa 2520 

ggagaaaact gttcaggatt tgaaacagga aataaaggct ctaaaagaag aaataggaaa 2580 

tgtccagctt gaaaaggctc aacagttatc tatcacttcc aaagttcagg agcttcagaa 2640 

cttattaaaa ggaaaagagg aacagatgaa taccatgaag gctgttttgg aagagaaaga 2700 

gaaagaccta gccaatacag ggaagtggtt acaggatctt caagaagaaa atgaatcttt 2760 

aaaagcacat gttcaggaag tagcacaaca taacttgaaa gaggcctctt ctgcatcaca 2820 

gtttgaagaa cttgagattg tgttgaaaga aaaggaaaat gaattgaaga ggttagaagc 2880 

catgctaaaa gagagggaga gtgatctttc tagcaaaaca cagctgttac aggatgtaca 2940 

agatgaaaac aaattgttta agtcccaaat tgagcagctt aaacaacaaa actaccaaca 3000 

ggcatcttct tttccccctc atgaagaatt attaaaagta atttcagaaa gagagaaaga 3060 

aataagtggt ctctggaatg agttagattc tttgaaggat gcagttgaac accagaggaa 3120 

gaaaaacaat gaaaggcagc aacaggtgga agctgttgag ttggaggcta aagaagttct 3180 

caaaaaatta tttccaaagg tgtctgtccc ttctaatttg agttatggtg aatggttgca 3240 

tggatttgaa aaaaaggcaa aagaatgtat ggctggaact tcagggtcag aggaggttaa 3300 

ggttctagag cacaagttga aagaagctga tgaaatgcac acattgttac agctagagtg 3360 

tgaaaaatac aaatccgtcc ttgcagaaac agaaggaatt ttacagaagc tacagagaag 3420 

tgttgagcaa gaagaaaata aatggaaagt taaggtcgat gaatcacaca agactattaa 3480 

acagatgcag tcatcattta catcttcaga acaagagcta gagcgattaa gaagcgaaaa 3540 

taaggatatt gaaaatctga gaagagaacg agaacatttg gaaatggaac tagaaaaggc 3600 

agagatggaa cgatctacct atgttacaga agtcagagag ttgaaggcac agttaaatga 3660 

aacactcaca aaacttagaa ctgaacaaaa tgaaagacag aaggtagctg gtgatttgca 3720 

taaggctcaa cagtcactgg agcttatcca gtcaaaaata gtaaaagctg ctggagacac 3780 

tactgttatt gaaaatagtg atgtttcccc agaaacggag tcttctgaga aggagacaat 3840 

gtctgtaagt ctaaatcaga ctgtaacaca gttacagcag ttgcttcagg cggtaaacca 3900 

acagctcaca aaggagaaag agcactacca ggtgttagag tgaagtaatt gggaaactgt 3960 

tcatttgagg ataaaaaagg cattgtatta tattttgcca aattaaagcc ttatttatgt 4020 

tttcaccctt tctactttgt cagaaacact gaacagagtt ttgtcttttc taatccttgt 4080 

tagactactg atttaaagaa ggaaaaaaaa aagccaactc tgtagacacc ttcagagttt 4140 

agttttataa taaaaactgt ttgaataatt agacctttac attcctgaag ataaacatgt 4200 

aatcttttat cttattttgc tcaataaaat tgttcagaag atcaaagtgg taaagacaat 4260 

gtaaaattta acattttaat actgatgttg tacactgttt tacttaacat tttgggaagt 4320 

aactgcctct gacttcaact caagaaaaca cttttttgtt gctaatgtaa tcggtttttg ™" 
taatggcgtc agcaaataaa aggatgctta ttattc 



50 <210> 37 

<211> 628 
<212> DNA 
<21 3> Homo sapiens 

55 <220> 

<221> unsure 

<222> 284 . . 284 

<223> n = a, c, g or t 



60 <220> 

<221> unsure 



<222> 594. .594 

<223> n = a, c, g or t 



4380 
4416 



65 <220> 

<221> unsure 
<222> 597. .597 
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<223> n = a, c, g or t 
<220> 

<221> unsure 

<222> 607. . 607 

<223> n = a, c, g or t 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 




<400> 37 

ggcacgagaa gaagtctaaa aatggaagcg atgaccaggg ataaaaaggt ggaaactctc 60 
atggtaccat caaaaaggca agaagcattg cccctccacc aagagactaa acaagaaagt 120 
ggatcaggga agaagaaagc ttcatcaaag aaacaaaaga cagaaaatgt cttcgtagat 180 
gaacccctta ttcatgcaac tacttatatt cctttgatgg ataatgctga ctcaagtcct 240 
gtggtagata agagagaggt tatttatttg cttaaacctg accnagtaga agggatccag 300 

360 
420 
480 

gaaggaagat gctgctgcta caaaggatcc gtgtaagcag ttaccccagg aatnatncca 600 
aagaaanaca gagcaatttg gtatacaa 628 

<210> 38 
<211> 730 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 

<222> 127 . . 127 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 260. . 260 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 307 . . 307 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 329. . 329 

<223> n ~ a, c, g or t 

<220> 

<221> unsure 

<222> 336. .336 

<223> n — a, c, g or t 

<220> 

<221> unsure 

<222> 473. . 473 

<223> n - a, c, g or t 

<220> 

<221> unsure 

<222> 487 . . 487 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 494 494 

<223> n = a, c, g or t 

<220> 

<221> unsure 
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10 



15 



20 



25 



30 



35 



<222> 563. .563 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 640. . 640 

<223> n = a, c, g or t 

<220> 

<22 1> unsure 

<222> 681. . 681 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 689. . 689 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 708 . . 708 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 714 . .714 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 728 . . 728 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 730. .730 

<223> n = a, c, g or t 



40 



45 



50 



55 



60 



65 



<400 
aaagaattcg 
agattyyctt 
agactyncta 
agaattgact 
ggaagatgct 
gaaaganaga 
ggracataat 
gatgaagttt 
aaatggnata 
ctgcagaact 
aaacaggaaa 
ttgaaggttc 
atcagggngn 



> 38 
gcacgagtga 
ctgtccttga 
aaggagaagt 
acgcttatac 
gctgcyacan 
agcaatgtgg 
gtatyycaaa 
cagcaagttc 
ctgngagaat 
aaataaacta 
get acagcaa 
caactacacg 



agaccgaaac 
agactatgat 
ctggtgtaat 
atcagcttca 
aggateggtg 
ttataacang 
acaaaataca 
gtgagcagat 
gcagtcagca 
cgncaggatt 
gaggaagtca 
nacgectgng 



tgacaaagaa 
gttttctgaa 
acaagatget 
agaaaaggac 
taagcagt ta 
gatganagay 
tgtcagtyat 
ggaggcagag 
acactacaaa 
atgctaggtt 
aagaagaatn 
agaaggtggg 



aatgctgaag 
gatgaggctc 
t taaagaagt 
aagtyactcg 
acccaggaaa 
cgaatyggaa 
caagagacyc 
atagcycact 
tcaactggaa 
ggtgaatgag 
ctgagcaagc 
ggaggcgntt 



tgaagtttaa 
tttgtgttgt 
caagtaaggg 
ctgctgtgaa 
tgatgacaga 
cattagaaaa 
aacagatgea 
tgnagcagga 
agcaagcagt 
cctgactgag 
agctactcag 
cagngcctac 



<210> 39 

<211> 571 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 37 . . 37 
<223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 42 . . 42 
<223> n = a, c, g or t 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
730 
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<220> 

<221> unsure 
<222> 64 . . 64 
5 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 67 . .67 

10 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 73. .73 

15 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 86. . 99 

20 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 106. . 107 
25 <223> n - a, c, g or t 

<220> 

<221> unsure 
<222> 117 . . 117 
30 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 139. . 139 

35 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 153. . 153 

40 <223> n - a, c, g or t 

<220> 

<221> unsure 
<22.2> 243. .243 
45 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 257 . . 257 

50 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 264 . .264 
55 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 319. . 319 
60 <223> n - a, c, g or t 

<220> 

<221> unsure 
<222> 337 . . 337 
65 <223> n = a, c, g or t 

<220> 
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<221> unsure 

<222> 341. .341 

<223> n - a, c, g or t 

5 <220> 

<221> unsure 

<222> 353. .353 

<223> n = a, c, g or t 

10 <220> 

<22 1> unsure 

<222> 362. .362 

<223> n = a, c, g or t 

15 <220> 

<221> unsure 

<222> 373. .373 

<223> n « a, c, g or t 

20 <220> 

<221> unsure 

<222> 379. . 379 

<223> n - a, c, g or t 

25 <220> 

<221> unsure 

<222> 384 . . 385 

<223> n = a, c, g or t 

30 <220> 

<221> unsure 

<222> 419. .419 

<223> n = a, c, g or t 

35 <220> 

<221> unsure 

<222> 421. .421 

<223> n = a, c, g or t 

40 <220> 

<221> unsure 

<222> 427 . .427 

<223> n = a, c, g or t 

45 <220> 

<221> unsure 

<222> 433.433 

<223> n = a, c, g or t 

50 <220> 

<221> unsure 

<222> 440. . 442 

<223> n = a, c, g or t 

55 <220> 

<221> unsure 

<222> 444 . . 444 

<223> n = a, c, g or t 

60 <220> 

<221> unsure 

<222> 448. .448 

<223> n = a, c, g or t 

65 <220> 

<221> unsure 
<222> 450. .450 
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<223> n = a, c, g or t 
<220> 

<221> unsure 
5 <222> 455. .455 

<223> n = a, c, g or t 

<220> 

<221> unsure 
10 <222> 486. .486 

<223> n = a, c, g or t 

<220> 

<221> unsure 
15 <222> 488 . .489 

<223> n = a, c, g or t 

<220> 

<221> unsure 

20 <222> 493. .493 

<223> n = a, c, g or t 

<220> 

<221> unsure 

25 <222> 496. . 497 

<223> n — a, c, g or t 

<220> 

<221> unsure 
30 <222> 500. .500 

<223> n = a, c, g or t 

<220> 

<221> unsure 
35 <222> 504 . . 504 

<223> n = a, c, g or t 

<220> 

<221> unsure 

40 <222> 506. .507 

<223> n = a, c, g or t 

<220> 

<221> unsure 
45 <222> 515. .515 

<223> n = a, c, g or t 

<220> 

<221> unsure 
50 <222> 520. .520 

<223> n — a, c, g or t 

<220> 

<221> unsure 
55 <222> 523. . 523 

<223> n = a, c, g or t 

<220> 

<221> unsure 
60 <222> 525. .526 

<223> n = a, c, g or t 

<220> 

<221> unsure 
65 <222> 538. .538 

<223> n = a, c, g or t 
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<400> 39 

aayacaatgc cttttttatc ctcaaataga rcagttnccc anttacttca ctcttacacc 60 

tggnagngct ctntctcctt tgtgrnnnnn nnnnnnnnnc ccccynnccg aatgqtnctq 120 

taactgtgtt acagtctgnt ttaggcttac agncattgtc tccttctcag gaggctccgt 180 

ttctggggga acatcacttt tttcaataac agtagtgtct cccgcagctt ttaotattt t 240 

tgnctggata agctccngtg actnttgagc cttatgcaaa tcaccagcta ccttctgtct 300 

ttcattttgt tcagttctna gttt tgtggg tgtttcnttt nactgtgccy tcnactctct 360 

gncttctgta acntagggng gtcnntccct ctctgccttt tctagttccc tttcccaang 420 

ntctcgntct ccnccccagn nngncaanan ccgcnatttt tcgcttctta attcggtcta 480 

gccccngnnc cgnagnnggn aaangnnggc tgcanccggn ttnannaggc ctgtqtqntt 54 0 

catcggcctt tacctttccc tttattttct t 571 

<210> 40 

<211> 644 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> 415. .415 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 525 . . 526 

<223> n = a, c, g or t 



<400 
atattttgga 
cagcccagac 
ataagcagat 
aagaagagga 
aattacaggc 
acagtgttta 
tttcaaacaa 
aagttcagaa 
tggaaaaatg 
ggacttattc 
acaaagagtt 



> 40 
gcagaatgag 
ctccgcttca 
aaaacagact 
acttaaggat 
cctggcaaat 
tgttaaagat 
aatggaagaa 
gctacagact 
cattcaagaa 
aggtggcaac 
caagacttac 



get t tgaaag 
gttctagcag 
gaagattctt 
atacagaata 
gagcaggctg 
gataaaataa 
tttaagattc 
cttgtttctg 
aagatgagaa 
taaagagagg 
agctagcaaa 



ctcaaattca 
aagaat taca 
tagcaagtga 
tgaatttctt 
ctgctgcaca 
gattgctgga 
taaatgacca 
aacagectaa 
gttaaagact 
agetgatgea 
tgatcaggtt 



<210> 41 

<211> 566 

<212> DNA 

<213> Homo sapiens 



gcagttccat 
taaagtgatt 
acgtgatcgt 
att aaaagct 
tgaattggag 
agagcaacta 
aaacaaagca 
taaggatgtt 
gtggnngatt 
tagacagaca 
cttg 



tcccagatag 
gcagaaaagg 
ttaacaagta 
gaagtgcaga 
aagatgeaac 
caacatgaca 
ttaanatcag 
gtggaacaca 
acttgacact 
ttcatctctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
644 



<220> 

<221> unsure 
<222> 47.47 

<223> n ~ a, c, g or t 
<220> 

<221> unsure 
<222> 59. . 59 
<223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 91. . 91 
<223> n = a, c, g or t 



<220> 

<221> unsure 

<222> 104 . . 104 

<223> n = a, c, g or t 



<220> 

<221> unsure 
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<222> 441 . .441 

<223> n = a, c, g or t 

<220> 

5 <221> unsure 

<222> 448 . .448 

<223> n = a, c, g or t 

<220> 

10 <221> unsure 

<222> 457 . . 457 

<223> n - a, c, g or t 

<220> 

15 <221> unsure 

<222> 525 . . 525 

<223> n = a, c, g or t 

<220> 

20 <221> unsure 

<222> 539. .539 

<223> n =* a, c, g or t 



<220> 

25 <221> unsure 

<222> 564 . . 564 
<223>n=a, c, gort 

<400> 41 

caacacaatc tcaaggtctt caaactggga tgcagaagag gcctctntca agttatgtng 



30 



35 



40 



45 



50 



55 



60 



65 



tgctacttcc tgaacaaggg cttgaaaaga ntcaatggct tctngaagag cccgtaacca 120 

cttccccgta ttggctaggg ctttctcttt ctcttccaaa acagccttca tgggattcat 180 

ctgtgcctct tttcctttta ataagttcyg aagctcctga actttggaag tgatagataa 240 

cygttgagcc ttttcaagct ggacatttcc tatttcttct tttagagcct ttatttcccg 300 

tttcaaatcc cgaacagttt tctccttgtt agcaacattg agaagtgccg cctccagaag 360 

ctcttctaca gacttgatct ttcacctctc tcatggatca ctctcttaag gtcttcaact 420 

agagaggcaa aagaacccga ncatttgnta gttttangct cgaacttctt ttgtcagaga 480 

tgaattttct gttcttattg cattcagctc ctcttcttta ggtgncacca gaataaggnc 540 

agtgtcaaga aattcttcca cagnct 566 



<210> 42 

<211> 878 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> 595. .595 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 756. .756 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 766. .766 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 815. .815 

<223> n = a, c, g or t 

<220> 

<221> unsure 
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<222> 818 . .818 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 827 . .827 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 831. .831 

<223> n = a, c, g or t 



<220> 
<221> 
<222> 
<223> 



unsure 
853. .853 
n = a, c, 



g or t 



<220> 

<221> unsure 

<222> 866. .866 

<223> n ~ a, c, g or t 

<400> 42 

accttaacct cctctgacct gaagttccag ccatacattc ttttgccttt ttttcaatcc 60 

atgcaaccat tcaccataac tcaaattaga agggacagac acctttggaa ataatttttt 120 

gagaacttct ttagcctcca actcaacagc ttccacctgt tgctgccttt cattgttttt 180 

cttcctctgg tgttcaactg catccttcaa agaatctaac tcattccaga gaccacttat 240 

ttctttctct ctttctgaaa ttacttttaa taattcttca tgagggggaa aagaagatgc 300 

ctgttggtag ttttgttgtt taagctgctc aatttgggac ttaaacaatt tgttttcatc 360 

ttgtacatcc tgtaacagct gtgttttgct agaaagatca ctctccctct cttttagcat 420 

ggcttctaac ctcttcaatt cattttcctt ttctttcaac acaatctcaa gttcttcaaa 480 

ctgtgatgca gaagaggcct ctttcaagtt atgttgtgct acttcctgaa catgtgcttt 540 

taaaagattc attttcttct tgaagatcct gtaaccactt ccctgtattg gctangtctt 600 

tctctttctc ttccaaaaca gccttcaagg tattcatctg ttcctctttt ccttttaaaa 660 

agttctgaag ctcctgaact ttggaagtga tagataactg ttgagccctt tcaagcctgg 720 

acattcctaa ttcctccttt aaaagcctta attccngttt caaaanccga aacagttttc 780 

ccccttgtaa gcaactttga gaaattcggc tccanaancc ctcctanaga nttgatcttt 840 

caacctttcc aanggattac tttccnaaag ttccttaa 878 

<210> 43 

<211> 620 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 

<222> 391. . 391 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 400. .400 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 411 . .411 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 414 . . 414 

<223> n = a, c, g or t 



<220> 

<221> unsure 
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<222> 418 .. 418 

<223> n = a, c, g or t 

<220> 

5 <221> unsure 

<222> 421 . .421 

<223> n - a, c, g or t 

<220> 

10 <221> unsure 



<222> 427 . .428 

<223> n = a, c, g or t 



<220> 

15 <221> unsure 



<222> 435. .435 

<223> n = a, c, g or t 



<220> 

20 <221> unsure 



<222> 446. .446 

<223> n = a, c, g or t 



<220> 

25 <221> unsure 



<222> 454 . .455 

<223> n = a, c, g or t 



<220> 

30 <221> unsure 



<222> 463. .463 

<223> n = a, c, g or t 



<220> 

35 <221> unsure 



<222> 465. .465 

<223> n = a, c, g or t 



<220> 

40 <221> unsure- 



<222> 471 . . 471 

<223> n = a, c, g or t 



<220> 

45 <221> unsure 



<222> 480. . 480 

<223> n - a, c, g or t 



<220> 

50 <221> unsure 



<222> 484 . .484 

<223> n = a, c, g or t 



<220> 

55 <221> unsure 



<222> 490. .490 

<223> n = a, c, g or t 



<220> 

60 <221> unsure 



<222> 493. .493 

<223> n - a, c, g or t 



<220> 

65 <221> unsure 



<222> 500. .500 

<223> n = a, c, g or t 
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<220> 

<221> unsure 
<222> 514 . . 515 
<223> n = a, c, 



g or t 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<220> 

<221> unsure 

<222> 524 . .524 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 539. .539 

<223> n = a f c, g or t 



<220> 

<221> unsure 
<222> 568. .568 
<223> n = a, c, 



<220> 

<221> unsure 
<222> 573 . . 574 
<223> n = a, c, 

<220> 

<221> unsure 
<222> 581 . . 581 
<223> n = a, c, 

<220> 

<22 1> unsure 
<222> 600. . 600 
<223> n = a, c, 

<220> 

<221> unsure 
<222> 604.. 604 
<223> n = a, c, 

<220> 

<221> unsure 
<222> 608 . . 608 
<223> n = a, c, 

<220> 

<221> unsure 
<222> 616. . 616 
<223> n = a, c, 

<220> 

<221> unsure 
<222> 619. . 619 
<223> n = a, c, 



<400> 
accttaacct 
catgcaacca 
tgagaacttc 
tcttcctctg 
tttctttctc 
cctgttggta 
cctgtacatc 
nggggtnnca 
caangggtan 
ctttaagatc 



43 

cctctgaccc 
ttcaccataa 
tttagcctcc 
gtgttcaact 
tctttctgaa 
gttttgttgt 
ctgtaacagc 
acctncttaa 
tgnaaaaaan 
attttctctg 



g or t 



g or t 



g or t 



g or t 



g or t 



g or t 



g or t 



g or t 



tgaattccag 
ctcaaattag 
aactcaacag 
gcatccttca 
attactttta 
ttaagctgct 
tgtgttttgc 
atacanttcc 
gcccctttca 
aagatccngt 



ccatacattc 
aagggacaga 
cttccacctg 
aagaatctaa 
ataattcttc 
caatttggga 
naagaaagtn 
tttnncttta 
agtnnggttg 
acnnttcccg 



ttttgccttt 
cacctttgga 
ttgctgcctt 
ctcattccag 
atgaggggga 
cttaaacaat 
actccccctc 
aananaatct 
tgcnacttcc 
nattggtagg 



ttttcaaatc 
aataattttt 
tcattgtttt 
agaccactta 
aaagaagatg 
ttgttttcat 
nccnttanca 
naggttcctn 
tgaacatgng 
cttcccttcn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ttcnaaanag cttcanggng 

<210> 44 

<211> 623 

5 <212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
10 <222> 25. .25 

<223> n = a, c, g or t 

<220> 

<221> unsure 

15 <222> 409. . 409 

<223> n = a, c, g or t 

<220> 

<221> unsure 

20 <222> 412 . . 412 

<223> n = a, c, g or t 

<220> 

<221> unsure 
25 <222> 414 .. 414 

<223> n = a, c, g or t 

<220> 

<221> unsure 
30 <222> 423. .423 

<223> n = a, c, g or t 

<220> 

<221> unsure 
35 <222> 426. .426 

<223> n = a, c, g or t 

<220> 

<221> unsure 

40 <222> 435. .435 

<223> n = a, c, g or t 

<220> 

<221> unsure 

45 <222> 444 .. 444 

<223> n - a, c, g or t 

<220> 

<221> unsure 

50 <222> 450. . 450 

<223> n = a, c, g or t 

<220> 

<221> unsure 

55 <222> 454 . . 454 

<223> n = a, c, g or t 

<220> 

<221> unsure 
60 <222> 459. .459 

<223> n = a, c, g or t 

<220> 

<221> unsure 

65 <222> 490. .490 

<223> n = a, c, g or t 



620 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<220> 

<221> unsure 

<222> 508 . .508 

<223> n = a, c, g or t 



<220> 
<221> 
<222> 
<223> 



unsure 
511- .511 
n - a, c, 



g or t 



<220> 

<221> unsure 
<222> 515. .515 
<223> n - a, c, 

<220> 

<221> unsure 

<222> 524 . . 524 

<223> n = a, c, 

<220> 

<221> unsure 
<222> 545. .545 
<223> n = a, c, 

<220> 

<221> unsure 

<222> 555. .555 

<223> n = a, c, g or t 



g or t 



g or t 



g or t 



<220> 
<221> 
<222> 
<223> 



unsure 
564. .564 
n = a, c, 



<220> 

<221> unsure 
<222> 600. . 600 
<223> n = a, c, 



<220> 
<221> 
<222> 
<223> 



unsure 
611. .611 
n = a, c, 



g or t 



g or t 



g or t 



<400> 44 

accttaacct cctctgaccc tgaantccag ccatacattc ttttgccttt ttttcaaatc 
catgcaacca ttcaccataa ctcaaattag aagggacaga cacctttgga aataattttt 
tgagaacttc tttagcctcc aactcaacag cttccacctg ttgctgcctt tccttggaag 
tcttgttcac tttgtcctgc agcatttttt cagttgatgc caatgcttcc attgcttccc 
agtttttctc ccgaaggtca ttgtttttct tcctctggtg ttcaactgca tccttcaaag 
aatctaactc attccagaga ccacttattt ctttctctct ttctgaaatt acttttaata 
attcttcatg agggggaaaa gaagatgcct ggtgggaatt tggtggttna ancnggccaa 
ttnggnacct aaaanatttg gttncaactn gtanaccgng taaaaactgt gttttgctaa 
aaagataacn ccccctctct ttaagcangg nttcnaacct cttnattcat ttccttttct 
tcaanaaatc tcagntctca aacngggatg caaaagaggc tcttcaagtt aaggtgggcn 
acttccggaa natggccttt aaa 

<210> 45 

<211> 625 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> 392. .392 

<223> n = a, c, g or t 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
623 
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<220> 

<221> unsure 
<222> 408 . . 408 
5 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 417 . .417 
10 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 421 . .421 

15 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 424 . .425 

20 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 437 . .437 

25 <223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 439. .439 

30 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 456. . 456 
35 <223> n - a, c, g or t 

<220> 

<221> unsure 
<222> 467 . .467 
40 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 469. .469 
45 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 473. .473 
50 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 475. .475 
55 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 496. .496 
60 <223> n = a, c, g or t 

<220> 

<221> unsure 
<222> 499. .500 
65 <223> n = a, c, g or t 

<220> 
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<221> unsure 
<222> 516. .516 
<223> n ~ a, c, 



g or t 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<220> 

<221> unsure 

<222> 524 . .524 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 535. .535 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 538 . . 538 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 560. .560 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 582. .582 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 592. .592 

<223> n - a, c, g or t 

<220> 

<221> unsure 

<222> 594 . . 594 

<223> n = a, c, g or t 

<220> 

<221> unsure 

<222> 604 . . 604 

<223> n = a, c, g or t 



<400 
accttaacct 
catgcaacca 
tgagaacttc 
tcttcctctg 
tttctttctc 
cctgttggta 
cttgtacatc 
ncanngggtt 
tcaaactgtg 
ctttaaagat 
ttcncttcaa 



> 45 
cctctgaccc 
ttcaccataa 
tttagcctcc 
gtgttcaact 
tctttctgaa 
gttttgttgt 
ctggtaacag 
caacccntna 
atgcanaann 
catttcttcn 
aaaagcctca 



tgaagtccag 
ctcaaattag 
aactcaacag 
gcatccttca 
atta'ctttta 
ttaagctgct 
ctgtgttttg 
aattacattt 
ggctctttca 
tgaaatccgt 
aggta 



ccatacattc 
aagggacaga 
cttccacctg 
aagaatctaa 
ataattcttc 
caatttggga 
cntagaaaag 
cccttntctt 
agttangttg 
aaccacttcc 



ttttgccttt 
cacctttgga 
ttgctgcctt 
ctcattccag 
atgaggggga 
cttaaacaat 
attactcncc 
aaacaanant 
tgcnacttcc 
cngtatgggc 



ttttcaaatc 
aataattttt 
tcattgtttt 
agaccactta 
aaagaagatg 
ttgttttcat 
ctctctnttt 
ctnangttct 
tgaanatntg 
angncttccc 



<210> 46 

<211> 4416 

<212> DNA 

<213> Homo sapiens 

<400> 46 

gccgcacagg gttttatagg atcacattga caaaagtacc atggagtttt atgagtcagc 

atattttatt gttcttattc cttcaatagt tattacagta attttcctct tcttctggct 

tttcatgaaa gaaacattat atgatgaagt tcttgcaaaa cagaaaagag aacaaaagct 

tattcctacc aaaacagata aaaagaaagc agaaaagaaa aagaataaaa agaaagaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
625 



60 
120 
180 
240 
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ccagaatgga aacctccatg aatccgactc tgagagtgta cctcgagact ttaaattatc 300 

agatgctttg gcagtagaag atgatcaagt tgcacctgtt ccattgaatg tcgttgaaac 360 

ttcaagtagt gttagggaaa gaaaaaagaa ggaaaagaaa caaaagcctg tgcttgaaga 420 

gcaggtcatc aaagaaagtg acgcatcaaa gattcctggc aaaaaagtag aacctgtccc 480 

5 agttactaaa cagcccaccc ctccctctga agcagctgcc tcgaagaaga aaccagggca 540 

gaagaagtct aaaaatggaa gcgatgacca ggataaaaag gtggaaactc tcatggtacc 600 

atcaaaaagg caagaagcat tgcccctcca ccaagagact aaacaagaaa gtggatcagg 660 

gaagaagaaa gcttcatcaa agaaacaaaa gacagaaaat gtcttcgtag atgaacccct 720 

tattcatgca actacttata ttcctttgat ggataatgct gactcaagtc ctgtggtaga 780 

10 taagagagag gttattgatt tgcttaaacc tgaccaagta gaagggatcc agaaatctgg 840 

gactaaaaaa ctgaagaccg aaactgacaa agaaaatgct gaagtgaagt ttaaagattt 900 

tcttctgtcc ttgaagacta tgatgttttc tgaagatgag gctctttgtg ttgtagactt 960 

gctaaaggag aagtctggtg taatacaaga tgctttaaag aagtcaagta agggagaatt 1020 

gactacgctt atacatcagc ttcaagaaaa ggacaagtta ctcgctgctg tgaaggaaga 1080 

15 tgctgctgct acaaaggatc ggtgtaagca gttaacccag gaaatgatga cagagaaaga 1140 

aagaagcaat gtggttataa caaggatgaa agatcgaatt ggaacattag aaaaggaaca 1200 

taatgtattt caaaacaaaa tacatgtcag ttatcaagag actcaacaga tgcagatgaa 1260 

gtttcagcaa gttcgtgagc agatggaggc agagatagct cacttgaagc aggaaaatgg 1320 

tatactgaga gatgcagtca gcaacactac aaatcaactg gaaagcaagc agtctgcaga 1380 

20 actaaataaa ctacgccagg attatgctag gttggtgaat gagctgactg agaaaacagg 1440 

aaagctacag caagaggaag tccaaaagaa gaatgctgag caagcagcta ctcagttgaa 1500 

ggttcaacta caagaagctg agagaaggtg ggaagaagtt cagagctaca tcaggaagag 1560 

aacagcggaa catgaggcag cacagcaaga tttacagagt aaatttgtgg ccaaagaaaa 1620 

tgaagtacag agtctgcata gtaagcttac agataccttg gtatcaaaac aacagttgga 1680 

25 gcaaagacta atgcagttaa tggaatcaga gcagaaaagg gtgaacaaag aagagtctct 1740 

acaaatgcag gttcaggata ttttggagca gaatgaggct ttgaaagctc aaattcagca 1800 

gttccattcc cagatagcag cccagacctc cgcttcagtt ctagcagaag aattacataa 1860 

agtgattgca gaaaaggata agcagataaa acagactgaa gattctttag caagtgaacg 1920 

tgatcgttta acaagtaaag aagaggaact taaggatata cagaatatga atttcttatt 1980 

30 aaaagctgaa gtgcagaaat tacaggccct ggcaaatgag caggctgctg ctgcacatga 2040 

attggagaag atgcaacaaa gtgtttatgt taaagatgat aaaataagat tgctggaaga 2100 

gcaactacaa catgaaattt caaacaaaat ggaagaattt aagattctaa atgaccaaaa 2160 

caaagcatta aaatcagaag ttcagaagct acagactctt gtttctgaac agcctaataa 2220 

ggatgttgtg gaacaaatgg aaaaatgcat tcaagaaaaa gatgagaagt taaagactgt 2280 

35 ggaagaatta cttgaaactg gacttattca ggtggcaact aaagaagagg agctgaatgc 2340 

aataagaaca gaaaattcat ctctgacaaa agaagttcaa gacttaaaag ctaagcaaaa 2400 

tgatcaggtt tcttttgcct ctctagttga agaacttaag aaagtgatcc atgagaaaga 2460 

tggaaagatc aagtctgtag aagagcttct ggaggcagaa cttctcaaag ttgctaacaa 2520 

ggagaaaact gttcaggatt tgaaacagga aataaaggct ctaaaagaag aaataggaaa 2580 

40 tgtccagctt gaaaaggctc aacagttatc tatcacttcc aaagttcagg agcttcagaa 2640 

cttattaaaa ggaaaagagg aacagatgaa taccatgaag gctgttttgg aagagaaaga 2700 

gaaagaccta gccaatacag ggaagtggtt acaggatctt caagaagaaa atgaatcttt 2760 

aaaagcacat gttcaggaag tagcacaaca taacttgaaa gaggcctctt ctgcatcaca 2820 

gtttgaagaa cttgagattg tgttgaaaga aaaggaaaat gaattgaaga ggttagaagc 2880 

45 catgctaaaa gagagggaga gtgatctttc tagcaaaaca cagctgttac aggatgtaca 2940 

agatgaaaac aaattgttta agtcccaaat tgagcagctt aaacaacaaa actaccaaca 3000 

ggcatcttct tttccccctc atgaagaatt attaaaagta atttcagaaa gagagaaaga 3060 

aataagtggt ctctggaatg agttagattc tttgaaggat gcagttgaac accagaggaa 3120 

gaaaaacaat gaaaggcagc aacaggtgga agctgttgag ttggaggcta aagaagttct 3180 

50 caaaaaatta tttccaaagg tgtctgtccc ttctaatttg agttatggtg aatggttgca 3240 

tggatttgaa aaaaaggcaa aagaatgtat ggctggaact tcagggtcag aggaggttaa 3300 

ggttctagag cacaagttga aagaagctga tgaaatgcac acattgttac agctagagtg 3360 

tgaaaaatac aaatccgtcc ttgcagaaac agaaggaatt ttacagaagc tacagagaag 3420 

tgttgagcaa gaagaaaata aatggaaagt taaggtcgat gaatcacaca agactattaa 3480 

55 acagatgcag. tcatcattta catcttcaga acaagagcta gagcgattaa gaagcgaaaa 3540 

taaggatatt gaaaatctga gaagagaacg agaacatttg gaaatggaac tagaaaaggc 3600 

agagatggaa cgatctacct atgttacaga agtcagagag ttgaaggcac agttaaatga 3660 

aacactcaca aaacttagaa ctgaacaaaa tgaaagacag aaggtagctg gtgatttgca 3720 

taaggctcaa cagtcactgg agcttatcca gtcaaaaata gtaaaagctg ctggagacac 3780 

60 tactgttatt gaaaatagtg atgtttcccc agaaacggag tcttctgaga aggagacaat 3840 

gtctgtaagt ctaaatcaga ctgtaacaca gttacagcag ttgcttcagg cggtaaacca 3900 

acagctcaca aaggagaaag agcactacca ggtgttagag tgaagtaatt gggaaactgt 3960 

tcatttgagg ataaaaaagg cattgtatta tattttgcca aattaaagcc ttatttatgt 4020 

tttcaccctt tctactttgt cagaaacact gaacagagtt ttgtcttttc taatccttgt 4080 

65 tagactactg atttaaagaa ggaaaaaaaa aagccaactc tgtagacacc ttcagagttt 4 140 

agttttataa taaaaactgt ttgaataatt agacctttac attcctgaag ataaacatgt 4200 

aatcttttat cttattttgc tcaataaaat tgttcagaag atcaaagtgg taaagacaat 4260 
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gtaaaattta acattttaat actgatgttg tacactgttt tacttaacat tttgggaaqt 4320 
aactgcctct gacttcaact caagaaaaca cttttttgtt gctaatgtaa tcggtttttg 4380 
taatggcgtc agcaaataaa aggatgctta ttattc 4416 
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5EOIJEHCF. LISTING 

<110> Ludwig 2A3titul:« for (^nncer tt&eear'<:]i 

5 <L2l]> Cflri^Kr Assoudat d Antigens 

and Uees Thftreror 

<130> L04ei/"?O50HO 

CO <14Q:> H£ 05/166, 300 

<141> 

<140> US 03/lGG r 35O 

L5 

<17f» Fast£Si$Q for windows Version *S.O 

<211> 4422 

<212> DWA 

<212> Homo sapiens 



25 <4UU> 2 

gattattDtg gaggaagatg accttcattt: tatggttatg tgtgtgtgtc tgtgtgtctt 60 

tggaaaaatn t n tatatft t tt.tr^^t-.^j q.>?qw-rs^ca tntcnngttfot tganttaaag ^20 

tatgctgcag aatatatatt. ctaaaactac aaaaaagtca ct^aataL^a ^daLgdtACti ISO 

gcttatacat atagttactg C gncaci^tc?^ c^qactqcta gttcagaatt caaanntcct 24 0 

30 M.i:£i:a4i.i.t. yt.gagaLaai: gggctaaatt cctt ctg^t L gtcautgygg <:«a«gMAaL 300 

tgctttagtt tttgatgaga gttcttagas gtttgttggt attocttcat ccacagcatc 360 

cattgttgaa ntanccattt t w rjt t c?"tqo» t-cjix^t-t-o ci Mq^aqccn^ tt-qt-ta^cot d^u 

yArtrtl.^rtrtL m.iggLagcc agtttcaatg aagctagaga ttagtcagaa aaagttagct 4 BO 

gttgggcttt agaaagggat. tttgagtcct gncatttcta cttgggagca ttttggagca 5*0 

3-3 rjatta^qctt bcarrt-ftt^ti^ HHiiHfettLQyt: Lat^LyALgy Aia«<:l-.KL'J-.i:fc: Kw<j;<:i.-Lt_ji Ld 600 

gggaaacLga gceraagctg aatgatattg tctgctgcaa aaaaaaacaa acaaaaaaaa CCD 

aacaafidcaa aaaacaaaaa n hm o i\n i» « ^ « fi yct-^qt- qccqaattcq ircacqugrjyq 720 

aagtf;c:<igl\y uat^r^ccgc ccrtrcggccg ccgccgcccc gctggccctg cagccgtcgc 780 

cgctgcctcg ggctacagcc ccgggotcqq crjrjt.cccgcjc tggggaagga gggcqqc^oq 64 0 

40 cgcqtccgga gccgccgq^q *jhijgr_y gyay ggiia^li^cgg K<:<:^i: guyg^grgg 500 

ccggcagcg^ cgagatcgtc caactgaacg taggggggac cagatttagt acctcaagac 9 60 

aaactcttat gtggattcca qat t ot t tt tttwaqtt-t r*ct q* qt.qqq ag* **hkfc:L:a.A 1&20 

t fjej<»q# bqpnpctgql. QCtaCattta tt gat a gaga tceagcagca tttgcaccca 10 (SO 

ttttaaattt tcrttcggaca aaagaactag acttaagggg agtgagtatt aatgttstr\n llflU 

45 ggca t gaagc agaattttac ytjqjil-.r:ni;li^ i:at_l:agtwidy aayy t:LLc:l:<: t:L:w Lgtgaag 1200 

aattggag£3 tLcc r.^ttgt ggcagtgtcc tttttcatgg ttacttgcce ccaccsggta 12613 

ttcctagtcg taaaataaac aa<jacngtcn qntctgctgn ttctaggaat ggt ctaodtt 1320 

ct oica gaotgq tgaacr^ccqq ana C. qq 1. a cacagtetgL U:1:n l.^fcyya. rt^iygg^gaag 13B0 

aaacLgtlig gct&ggattt cctgtggatc cacgaaaggc gctaatagta gctggccatc 14^0 

50 acaactggat tgtagctgca tc>tgci; i- a 1 1. t.hgi;tyngtq ^ta^garstc: o>^ijM»t.i:t. I. 1500 

caggatggca gcsegtgttt acgagDccat atttggattg gscrtatcgaa cgagtagctt 15f.n 

taaatgcaaa ggtggttgga ^c?qcc<?c^tq q^q^Maaqa can*>ntqcrtt qct^ttq«r-1. 1 &20 

onijrtq?igtnq c«tcfltr^ttq i.gg^gtgUtc aggatgg^g ^ft^Lggaagt gAAattggag 1680 

tgttcagcct gggtgttcct gtagatgctD tcttctttat tggtaaccag ttggtggcoa 

5& cgagtcatac agggaaagtg ggc^qtqt-qqji «tqct.qtcac tc*?qc^ctqrr c^qqttcz.«y 1&00 

At:ijl:tqttw t^t^ritMcjl. i:atgacactg ctggatc^Lt ccttctgctt ggatgtaaoa 

atggatcaat atattacata gatatgcaga agttcccctt g<jgaatgaan gatant-^ti? 1 S20 

ttcttgtaac tgaactgtpt cfltq^tvc^l: 4:aa& k<j,^* b<j<i l;rti.hj?^;K.qi;S. r2h<j^gtgtt.t 1W0 

Ao^t^cir-flw yooa^tn-ysyl. gtcagtggta actggatcga gatcgcctst ggtacgagct 20fl0 

60 ctggagcagt a^g&gtgatt gtacaacaDD cagagacagt tgggtDaggt cctcagcttt 2H30 

ttcagacttt cacagttcac cgaagtc t;oq taofoaaaaat c^tgttntcyi r^flqflyoc^r.f: 2160 

ttr^tatcagt ctgtgcagat. cj » K«s L L g Lccgg^^Lg g^ c&q C d ei c a cgattcngag 2230 

gaatgatfil:^ 1:ArLcagct:a ggttctBDtc ctttagcgtc attcaagata ctatcccttTrj 22 B0 

aggagacaga aagtcatggt agct^ttTCt ctqq^^tg^ c vt nqq n r?ot tt-t-qii^q&gc; 234 0 

<ja.qacgoLtcG acaggtqttt ^t-<:o^aaag ttgttcccat caccaacaaa ctatttgtaa 24 On 

gactcLcAfc^ gaLtggaaaa agaatatgtg agatccaggc tgttga^tgt actacaat.it 2 0 60 

cctcatttac agtgagggaa tgtgsgggnt cc-agtagq^K qqqiii.i'am^ txv>flf?tjoyi.:L 2520 
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ID 



15 



2ft 



25 



30 



35 



<0 



45 



5ft 



,15 



6a 



acttgttcac 
ntatggttaa 

caacttccgt 
acccttgatac 
ctttat-tano 
ttaatttgaa 
ctagctgaagt 

gtttttcaga 
gqann<a#agq 
citcatcacc 
ctacaaagac 
get tgtgaan 

catttagtat 
ttge*c=atttt 
ggsgtccatt 
taggatggtg 

ttgtctttgg 
tgatgei^qtq 
aataccagat 
tttt ctgatt 
c tia a a m l: La<: 
aaaataataa 
a cctgt ct tg 
1 1 a a a i\ t a a a 
gttttggctg 
yttttt-gtsK? 
aattgtaaat 
saatataa^t 



aggtrcataca 
caaaagtgaa 

agttragcat 
cacccatgaa 
OLagggcnngn 

gaaaggggca 
*a.qtttqnq.3 
atccaagaaa 

aai3l.A^Lts:c 
tactccatct 
actcaccaaa 

ctttttaaca 
q qri □ attttt 
ttaattactta 
cccatatagg 
pttat-yuut-g 
aacttggtct 
ct atgtttgt 
acca^tatgt 
ctcatcattg 
^i:l-q»AA<- 1: 1: 
tcaagggcaa 
ttt cnggagc 
aencCflgt-tt 
ctgtaagaat 
Mtttttttt 
ttaaagcatt 
gagtgaagtt 



aaUgyr^gto 
gataaggatg 
ttgagcacat 
agt<!Mi?tt5t 
gcagctactt 
nggactgaga 
g^eaqaqctg 
acaggggast 
qanttggata 

agg LCAh.<:«g 

gggggaggat 

cctcggcaLa 
atgaatagtt 

gaattacttg 
ttanttttac 
ccgar.1: 1: 1. LI; 
tgagcatccc 
cttqtct 
coca ac act t 
Qaaaaacata 
agtaagtrtt 
qgagatcttn 
LcaagLLgga 
agctttgagt 
caccatqtt t 
gf.t.eaotLgt 
gtgaat.tt.ga 

agcatttatt 
gnaataaaat 



<210> 2 

<2ll> 502 

<212> DNA 

<213> liomo sapiens 



<400> 
get, 3 get tog 
cctgttcttt 
tqct ttCCC C 

gccatLtacc 
caaaatacca 
gttggat.^y 
ctcaagcagt 
att<jcttggc 
atgtgtttcL 



gcgcqq^t^ri 

gcgccagcgg 
aaaatgagt g 
ct Lo a L a gg a 
gctgttactt 

aaggatgaag 
□gagaagtac 



i. % t.gggc:gh<:<: 
gaaccactat 
t tngt.njtcoo 
agagaaataa 
atcctaaaaa 
ccfl ty whhIi: 
atcctctagc 



<zir» 3 

<2ll> 

<212> DNA 

<213> Ht>mr> AApiAiii 



<400> 3 



CCatgfcQtf gy 

ttcctccagc 
qttcjtgnccg 
ttttt rr.rrag 
caagattacc 
aaggttgctg 

agcgagaag& 
agagaagece 
a-agccMggc 
tctcccr.gLL 
caggcgggcc 



agtgtgaccg 
tgtatgagtt 

tcragtagttc 
agtcattagg 
ocnn-gcacct 
gg q » y tig 1: yA 
tgtccaacga 
ccttgcsttgc 
t. y w agaa *g 
cgcagcttca 



c^c^tg^Li.g 
ggaggatctg 
tcttctqatq 
gg^atttgaa 
tggtctgtgt 
aatcacagaa 

C0*qc<?q.?q^C 

gg<r;l:g*;ggag 
taccactgaa 
tttgcccqct 
^£flLaf:aacg 
ggcactggct 



tt caaatgtg 

11Agijti3£tCC 
ctege LgLyt: 
rjoigaatcaaa 
attygtt CMt 
get tt£«9L:nq 

tccctgaaaa 
cfLrtrtt^tato 
gtggattggdi 
ajagntqtlGSta 
tccttggaag 
ctgnctcacc 

gtttcggtta 
^ctanaactt 
g^ataatyriLi 
a^gtacattt 
lihgqtttggn 
tttagatcat 
tacaatcitta 
attgtgaLLy 
tcatgtaatt 

1 1 ag^a Lg n t 
natcttagca 
gi:MAl: t.gbct 
geccagaagg 
ttttttcagt 
aggaol.go.^q 
atgtattttg 
<iv n t rjat-ct t 
ggtgaataa I. 
ct ttaaaatt 



g h t oq q n 

ctctgcsctc 
tttct t &nga 
cttiaacahLl-. 
tgagagtaca 

tgccaccaga 
Dact.gaagaa 



gl:g^i> 
gtgga 
QGqqt 
gagtt: 
t-ccca 
ttcct 

gsggq 
tCtDt 
qyttt 

aeggt 
aactt 



attgg 
tqnaa 
yca1.,i 
ggaga 
aegga 
tgggg 
f* C +- y y 
tgctg 
ttcco 

ttCl.y 

agaag 



gg ni. t qstcc 
aaccgaa^ia 
tact cctaac 

gaggcctt.ac^ 
ttotagggac 
tgy t ft Act bq 
tgagagaaag 
*jg*:yoat^aa 
tgaaaata^a 
rt^wqnnagtt 
tggtacrtg^.g 
ttcttcaggt 

tacaagataa 
.^t-^eoatnat 
aacagatcd L 
aacat.at.tac 

cctttaagtg 

ClMM^I^t.gttt 

caaaaacact 
tttnntttaq 
agcattagca 
gt gtttgaaa 

g« A»yi: h g K a 

gttancaaca 
ttctqnottt 
aaLLgLa^Li^ 
caggtttcta 
gl.rtr.^l-.ntr^ri 

ta 



rrt-orit-tegtc 
ttggggtttt 
ccttt caeca 

t. <: g i : jj gtt I . 
cgcccttctg 
gaaqnnt^tg 
gagtttattg 
gagctcaaaa 



c?crtcycqq 
ctctgcctca 
aagatcatct 

ccctggaaat 
agaaagaaat 
agcagctaga 

ccaacagcag 
ctccagt aaa 

r^b^ert^tqq 

ccagg^gggg 



actgetatgg 
g^qctactca 

atOAg ay 

cagcttcagc 
flr^arjaaotgt c 
ttci^aga^La 
ggtccaatac 
t c 1: C C h y y.M g 
atagctgaag 

aL^gay tttrt 

ccctatDtgg 

caggagtaca 
aaagt taanc 
ail. LggacLt 
cat at etc tt 

I: \.n Laaag-Cd 

cacgtcntaa 
ngn.rst gti^t.t: 
a a t ca cat aa 

aCtgatattg 
t cgtgcgtca 
aLattaaatg 
agatgaaata 

CC0><J L: t. y r: Lrt 

atcatgataa 
tgggttsacir? 
y<:/*A jagaac 
caaatagggt 
nccit.t.< a a g- 



gLLacatggc 
ggtttttgqt 
l: g t cttc 
aagagctaga 
tttcnc*r-fl#l. 
ag&tgtggag 
ccct t n t grjc* 



t qcrcctgnag 
gcrtggctciag 
agcaciaqc-r.H: 
g»»^£,tgtgc 
aaacaatgat 
acacccagnc 
gggagct^yi- 
aaqyft<":^aga 
aggcegggag 
tgtgactqtrji 

gaqr^ghgLtC 
gt.ctataggt 



25B0 

2"? GO 

2880 

3DD0 
3060 
3120 
3160 

3300 

3 ~i 1 3 

3420 
34 I? 11 
354 0 
3 GOO 

1720 

3840 

35C0 
jn-'o 
4080 
4140 
42U0 
4260 

4380 
4427 
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aggaggctga 

ggtggtggAg 
ggcaaggttt 
aoc*<jaggtgg 

actgaaggg I. 
gtgcactctg 

cttccgstgg 
t cacagtgac 
Aqwwi^tqtg 
agattcgLtL 
qaatattgca 
aca t tt l: iW. 
ggcgcgtctg 

gALgwttgqc 

cttgctccat 
tgggcagcat 



wtcccggga 

CCCdgqL v ai:M 

acacctggga 

gtgacttcca 
cagcccctcc 
ggnctgtgcc 
Ql.i:1.r.v<tJfUiq 
gccragagaya 
tgggagccat 

tgatgcagag 
gagtcggtcg 
gw I. fc.L.t.Mtcri 
cagatgaacc 

acgctcatga 
cctqct ccgt 

tttattgata 
attaaat taa 



tqtcagLgyy 
gqtqgcagcg 

cag^.<: t< J€»qq 
gccgcctggg 
r;r;tqaaccac 
gc'ttgagw;*.^ 
caagceaggg 
Mtttcgggg 

gagttscrtgt 
tcc&ct ccng 
gggcgL.-gnr^ 
tcaccgcgag 
ciqaqaagqt c 
ttttatttaL 
teat sat aat 

gttttl.tAl.y 

ttcccttctc 

gaagggggaa 
tttaaaaa 



<210> 4 

<21L> HHfi 

<212> DNA 

<Z13> Homo capieng 



<A00> A 



ceg^gqilyyr;. 
cctcggctgg 
gr;gizE^!0Cir2q 
tattgc<igct 
caggcaggag 
g r; a tzcjq^q a q 
attgtc-aLtc 
cagatctgca 
g L gt rt.«<j n t. r. 
aaacatcggc 
tcttnctaan 
atcgtagggc 
cagntggtga 
c&g > atckc:ht 
gtcngcctga 
tgt*i:atc:t.t 
cgctggtgtg 
gottrjnaagt 
tgtttttaga 
ttcgtcaagg 
a^Aggfcarjut 
ctcagtccat 
Tdgtcactat 
a ct&ag £j a m « 
tccctgtacc 



caacgggcgt 
atctggcccc 
gcctggtgcrg 
tagacacccg 
oocctcgrujo 
LgaLac tg l.c 
ggcacaactc 
togcnattta 
agataccaaa 
agcagtogaa 
taf-gcaaL-rtt 
tggcgttgga 
gtcctnctfci? 
gcttgagagc 

ccgatgqqtT 
cagaggattc 
ttggggaaga 

ggaacagtct 
aagattcttt 

Lg1iL£££l:gL 

cgcttggcac 
ttaagr^agaa 
aaagaagccg 



tccyqtaqqq 
ccctccactc 
ctqccccgcg 
ctt^r.ocg^c 
gacacccagc 
gcaqcaggcc 

cagatgttcc 
g*0*C*otc!g.?O 
ggaaaacaag 
gaagaacaaa 
^r_ac^tgar.a 
aacagctaca 

agtcagattc 

tacatrtgtg 
agctgtatta 
gtt^.gytttg 
cttgatggtt 
cctgntggcici 
Ltaagaacta 
aagacacaga 
as a ci&a n t 1 y 
aa tcagasgL 



<2 10> 5 

<211> ??4 

<212> DtfA 

<2i:*> Unono 



<4O0> 5 



tgi^goi^acnc 
ggaccctgr.^ 
gtcccaccca 
acP.yctggtc 
gcgcctrgtg^ 
tggtagagac 
c^A<-.tctgcc 



caocccaccc 

cccgccgcct 
tgtgaagagc 
gyt^tgcg<;l: 
ccccaggcct 
cocatttccc 



cccBcggcac 
oqqtqqtqcc 
tqyLeLLtCt 

gtccEocctg 
cgnt gtccrc 



r.tgcctgttt 

1 1 1 cc Lggjjq 

agcccgggtt 
gg y nr^t rjgac 
cagcagce<ra 
otttttgtctg 

fc<:l:o«qcct g 

gacttgam:i: 
gtggttgctg 
Cl:cAggg^>cii 
gtacaatagL 
quiijrjatgggc 

tatggcaagc 
m » » i j n ,i t o o t 
agcagatat t 
rtijcjijtTcitg^i 
t&ctgggaga 
tcaat taaaa 

AAAgt:i:nttt 



«gacgctcc 
CCCgay L c:c:c: 
aagatggctg 

ac-cccrtcrcrtt 
gccor^gccag 
rtLgrt t:y naqn 
gagctcsgcg 
cttqc5qaqc 
aacaggcaaa 
attggagctct 
r.AgAgiiL^MM 
ttgaagataa 

hc:«o^tttc^ 

acatatcajgt 
n r?ttrjqoicct 
ccagaoacco 
aacttgttcc 
£i 1. g y i":u! t: .i 
tctatagagc 
nt gt gat gat 
ggga&iriiuiM 
DDatcagacc 
q^rtatcipnt 
tttcatgung 



cfct<2Cf>sc<?gg 

ctcggggaca 
cqtgcc^cgM 
gt$<r:LLggftL 
ccaagactcc 
tqogggcgrxi 



atg<:t.nngtc 
agaatgct. r. I. 
lL:1:cjcagocc 
atgcagagqi? 
qccccaccag 
tggjjt.ar?cgt 
gcccac-a Lyy 
q^t tctccctg 

gaagcagatt 

qt gaagtgga 
«iMt.qt.-it.At t 

gtctttctg^; 

gta gca ct gt: 
l.rthqr^tTCtt 
agaaaatAAc? 
ctacttgcat 



ggtgctcgtA 

<:gqr?3qco<3C 

ccgtacgcreg 
tjctangctcc 

tzgggigyijnijq 

catggcagaa 
itfotgnangt 
gatgLLL-yVirf 
^igcaagaggc 
aga^ga3ts»»i3 
ctatgaagcc 
■jptbttcttat 
tgatg^tgac 
qqa^tctgtg 
gctgccaALg 
aagntcattg 

gtacacgt cc 
i ot cciTtaaot 

gttgacaacc 

y <jrt<: t q-d t rjO 

caaagtattt 
OLcccgatatt 



nttgaagggt 

1.»ijtfOqtsqq 

ccagagcLgc 
ggaactgggD 
1. h ljos te>t 1. 1. 
tcaggc^jgcg 
ggcagccacc 



aggagaynrjit 
gqaacactct 

tt Ltjhciitot 

□ggacac^Lt 

ny t Cr-t QCCt 

ggcragtggA^ 
Gtr-actattg 
Lgyq.Tbrrgng 
catggctttg 
^ttattc<aga 
gaggac^LLy 
qagttcctDt 

glciMyrttoCt 

tttctaatgt 
i:i:tqr?t^tCt 
gaatccctgt 
n<rt q t c-acct 
ttccatcaga 
ctagt cattc 
gAl;l:l:L1: 



<:«jat.i:r:t:<:<:g 
cgccacccca 

get g Da gaga 
tqcsqarjrrr?^ 

gcr.g^gaag 
ectgetgaaa 
r^tt£?t:br^CTi^ 
tctctccaga 
qrtoqa q no rj 
atcaagtttt 
acccggtctc 
a g l: m aaa L gg 
Dttaaactgt 
giiy y LccLyu 
gagcagttgt 
cqt Ctrjfjwt 
t.ggagaga;ga 
accacatat a 

Cj|-.<JCJMM hAl-.l'. 

cctgaagagc 
caattctact 
l:g<tl.gtaAta 
ttcgtcgggg 



cctgccggtg 
t qt to^jq t. 
i^g,?jc:ttggag 
tgggcctcgt 
yttg^at tgc 
tgtgggtccc 
caagcot-rict 



7na 
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1560 
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1920 
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yrj^tqcggCC 

cccgcacagc 
ttrxgtgttt.g 
at ttaeLgiia 
aaaaatgctg 



occtctcggc 

caccrcgtggg 
tctgcgtttg 

ctctcagcct 



caggcattgg 
cct goncagg 

gtgi:i.gi:i;i^t. 

tttttcaaga 
t^cacqtctg 
cccacaatL^ 



ty t-qcccggc 
gnccccgttt 

AAl^lirTC-OtCjt 



tgagtgggyy 
mir^ccacac 
gC'C'Q994:Jii:iL 

^ttgctgctt 
cai.ttttaca 



gl.c7yl:gqy<:c 
ccagcggcc-c 
nactccatqt 
ggattttgaa 
ccaatttggt 

d dH a 
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<23U> 6 

<211> 2936 

<212> DHA 

<2l3> lLc*mo sapiens 



<40C^ 
cgtaaccrctt 
tgtggaggtg 

qrqqq 1 1 *3CO 

ttatgaccta 
aggagtggga 

ggagacagtg 
ttacgatgtg 

Cdg&ALgawti 

MMMc^nLLf. My 
cccdEiataCd 
agrftaqaann 
cagaAtmga 
atgaatcaga 
at gttqat ct 

attctgaggc 
»<j y rit «<-!«qt: 
tggaaaaaga 
aagctgaacg 

A£i:tgg££ri(. 

ssaggaataa 

<2«-y Lityfbqn 

tacaagctgc 
C t ucjcaqa a 
aatctgaaaa 
ccaggaagga 

»t*3i-3t?C>q^l3 

aqqnaatggt 
cgcaactaga 
at a a get tea 

L^CHycig^qc- 

gactacaaga 

ggt.nttnaat 
gatatacaga 
acctqcntcta 
ccatttecl.o 
agactagaag 

ttgactiN.LL 
gecteaagaa 
tncaagtgaa 

attttttttt 
agatgettgg 
nt}t bqttciciaa 
aaaggctacn 



agtcccaatg 
tccccggtgg 
Drrrrciqcgtyo 
crtttagagga 
aatacccttg 
l.ii»y cirt^A 1:1: 
gattgacagg 
agcatcataa 
atgtaaagt L 
aaDttcagct 

H a a tGjAA 

acaagaatta 
qta 1 3 q 3 cr<;t. 
Acaceagaag 
gattgeaaga 
cacannaTcic 
atLaaaaggt 
tcaggtggaa 

c:,?q»t:r.i-Cl.g 
gctacaatca 
agaa a tsnnt 
AA^agacatc 
gcttcaaagt 
•i C'T>t^ ■ * h u a 
caaagaagaa 

Cttq#ttwyc:a 

ggstcragtat 
gcttcagagt 

ttcacttgea 
ggagagatta 
ao^teffcaaaq 
tgacegtate 

gaaagtagaa 

gaagacataa 
tcaat cctTt 
c tttatatgea 
aactggaaac 
cttggaqaac 

aaggtaccta 
gcattaanng 

ttggcatttt 
ccataacagg 

Tiqistq&Ctfflt 

nttf.cLaaaa 



crtc.tf:gtaag 
cgagcgcggc 

^3^^To^yt.cit.^T^T 

aggtataatt 
ganaganaac 

tdiUiyal.a I.y 
ttctcagtcg 
agctantt at 

OTtgct tgan 
acLy£dgd£d 
gaaactQcaa 
tj hfl 1 9 1 t. n 
gaagagtatg 
ctggaggaag 

ttagaggctg 
aatgoccaaa 
££ggctgaaa 
agcagtgaac 
* cattqtcc. 1 * 
aaactggaga 
gaaotgqatq 
gtgetcttag 
ggttatcaa*> 
gAtLLggaga 
gaagagasst 
gttagqttac* 

£ft£A£Lt£Lg 

cagtcagaga 

acattggaag 
acagacaqoq 
cagaaaaaga 
gtcttggagg 
tttuzi: hl:£«a 
tgaatttcga 

ggaggaacaa 

atttagatac 
etacatgetg 

a a at a gt t a g 
gagtgaaata 

gtCCCV*er:wr: 

t ha Aaagt c;a 
tctgtgtgca 



f:gg^^Ll.q^ 
cagaactg££j 
totaqqgagg 
tggct Litl.g 
taaaacagta 
q,fi tin <: httt-cs 
gagttc^a^ 
cagacactga 

gaactaagag 
t trf t:l.c:c:r-oi 
tgagagaacg 

h y r;t t rjiq ct a 

cacgtattl: L 
ataaagaaga 
b.iji]»tiCT?ct 
aa gtagcgga 
qant acaggt 

aa C b d L L i : 

aaaatacctt 

cagesagage 
qatt acaqto 
tagaaaagga 
aacttgtqgt 
aaatgaaagL 
tgegggstte 
c?a^ttcaqca 

atgacttget 

agaaacagat: 
ci^qaaaaqt a 
gaaa^LcLcL 
caaagaaaga 
Odt: l:/ihfl <zfi a 
agtctaattt 

I. li^j t CO qnco 

catcaaaggg 
nnacaactag 

tttttac'tgu 
tattgtatga 

wyHtjfltflttt 

agtcaagt r.t 
taactatcat 

gctf.a a a ga a 
cataattaaa 



tnrjgtgc.ctg 
Cj1.i:/j<:l*.l'.i3m] 
acacctctgg 
ay afctct-qnc; 
agaaagecaa 
c^enntaatt t 
« m m t y, t !: /^i h 
aggctgaaca 
bqbttocit-q^ 
gagaattagt 

tt t taggaa L 
tgaacntoca 

acta-egtaac 
tqctcgngao. 
a LLad^gycl 
gcggcagttg 
K h K:ii:y y 
tttaattaac 
aijaocttaan 
taagagLgag 
agacaatgaa 

Pt tcicaagat 
gcj a a ^; a l.ga L 
scagatggca 
acnncittgtq 
y eaaa l.caqL 
gaattcaaac 

acagtggttg 
oaat caaqct 
Lr:at^jdaai: 
agaattggaa 

tggttcctaa 

aactctctct 
aggaac^ttgg 

ttqc-caaaacr 
ctgtcaggat 

taaaamttt 
taattctctc 
tt-.^vt o t d t ^ 
r.gnadaggta 
gagcttaatt 



tgqttgqagc 
t.t.tt.<:<:gl.ql. 
a 1 1 gacjga t r- 

aai:ttatttt 
tcctcctqqt 

cacaaggttg 

agagaaaact 
1 1 qci.ifl^y 
cLdgatgaag 

ttt ctcciont- 
.m ,ir^ fj t a a aat 
cagctgetta 
aaaqtr.-hrttt 
i^/i£3aaggaga 
gctgacrntr^fj 

y£:.*jyadcgct 
ddittqeata 

c-tagaaagag 
attctcanaij 
a L&cgtaaag 
g t\ a^ri q 1 1 » 13 
gtctggaggc 
gaagagntcn 
act**i:igflfid 
agtttgcaga 
c-aaatqi^ty w 
gi:lgaaaaag 
gaagaaanqc- 
aaqrjrtqdadC- 
aaattgaaaa 

rtrtq Hi c l a a a a 
catgcot cm 

C^b.gyataCLa 

acttcqcaaa 
atcttci;qq» 

acttgaatgt 
tttaoq^ttn 

actttgttgn 
ttcat<::ttb.y 
c^aaaccgt 
tctatatacn 
tttaa^J 
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<4 UU 
naaggctaca 
L'Cjy artcjfx^H* 
agcaaacotg 
ggaattattt 
g l;^ h a a oatc 
attagectat. 
□ tctcaaagg 
C: Lrt exit tact 
acaacscctq 
tatctqeaaat 
t C't a^^:aa L 
acagccactg 
tcccattggt 
gagtggAAcl-. 
tc&caggttt 
Lggt.rsttciit- 
tgatgggcga 
tagatcaatt 
tf tytqqqcat 

ggacagttat 
gtatqgacta 
trtctgligga 
gaactttcct 

IqfciTCCa 



> 7 

tC'atf.AA<^: 

fcatgaaatgg 
QrtiHtnxscta 
ccccgcfca he 
tgggtgaata 
Avf*qqctqt t 
ctgcgcai:l:a 
cttctggatg 

gt £4 Lkg.^q 

agtagtaaaa 

acacagtAU. 
atgaaggggc 
q a cuj n at g 
aatgggtttc 
qaagatctta 

gttgtggatt 
ataagtgtcc 
a Lg<:«< : I . v. a y 
agactgaata 

l: K h /1 ottC 

caggacaacc 



tngaaggagg 
t CCrczfi grt.^c C" 
gaccagttaL 
ttctcttcct 
tgggwnHtqt 
ttagtcgaat 
Hrtqaggaaga 
atgaggaLiiw 
ttcgcaacaa 
t a I: ttg."ft 1: a 
cat get teat 
ttcit-ctcagg 

ttgccccatt 

at eg a gt cca 
ofcoqt ngctgc 
t gt L c c a L y.Lj 
gatttgacrcc 
acttaa en g t 
tggaLgafiaa 

c-aa4ft-£.gl:jji: 



<2io> e 

<:212> DMA 

<213> Homo a a pi ens 



cfcactgcrgtc 
oggctcgatc 

Lgtjt:kgti?pg 

catctgtcac 

taattgaagg 
ctqgtaaaac. 
gtaitaliLl.i: 
caattaaaaa 
a t£& 1 y n/Mj.i 
aatcaaatat 



> B 

getggagege 

cg«q b rj: 

gctgccrcgec 
flataagaccn 
accaggagag 
tqacattgac 

t L y £ I: h. kq n a 
atacacaatg 
tnrj^qctqtt 

tgggcttcct 
ygtccttgao 
aagaat teat 
qtgaatacag 



ggcggaggtg 

gecagcatgt 
cagcttgcca 
ecLcaggLct 
tcccagoaag 

ggaacaggat 
nnncaccttt 
get cca gatt 
ttntttgata 
gaagattcaa 
aaccnagag 
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<4D0> 
tgqaaattat 
aagsirtA.-^^n 
gaaaagggtg 
q^qtgatcat 
tgaaayt-t^t 
ggcaaacatt 
ccagoaaaga 
tu**»attcgg 
atctLat:t.c« 
catgaacaaa 
aaatcagtct 



agacctsgca 
rrt-stggetgg 
tttr:gg£A*iy 
gaggatgaag 
qntgaatc«ag 
aczttgLgaaa 
ctgcagactc 
gatactcagc 
qacu3ctaaang 

cagtatgaaa 
■jttcgect nn 



aaaaaagatt 
taaagaggat 

aagaggagga 
nttctgaatc 
ttgcaattaa 
tgaaaaagca 
ttgaaagaga 
caaacuia^qt 
ga i^ttcaag^ 
agenattgaa 
tgaaaa 



acga I: t.<iaaa 
tgtgtggaga 
"■■Mwnnncagc 
ga ga c a y r: 3 ^ 
accttctccg 
g<r:flqr>crafc c 
tatgcgctl.q 
t a aat t ggaa 
a ga tat g;s y K 
cnnggttccc 
gadc:Lc:MJic^c 
gagacatet r_ 
*ib qcstntggt 
aa age 11. 1 njrr 
ctcwangaa 
tgci4s»»gi;c-? 
agaggc<:tgg 
cjwqctaaga 
tt LCAAhhf.t. 
gattaagtta 

tctagcagaa 
rcq\iacttctc 



agccggccgc 
rjcgcctcagc 

tgy9^9<:<:t-:<7 

aagagaagat 

L£c:h ."igogfl ^ 

agcagatcta 

ttgatgttaa 
tt nagagtat 
ttaaagtya^ 
ccactcgtga 



tagagaaggl. 
aatacagaca 

LqHtt t.t« TCi ci 
ggaggaggag 
agntgaoaaa 

«?jiji:t-r3 

gtatgaagag 
ecaggtgett 
taggtctgnn 

gaaattgcag 



cga a c ti^r:^ c?. 
q nzi Ot 1 1 a t C 
aaq»r??iqncQ 

cctgccactn 
«rttqcacctt 
aaggaaal.hc: 
tggctataca 
t d L L l:y /d .-j .1 o 
tqgoctgagg 

dCdtj^MM? 1 jrj 

at ccagtgtg 
i.^tqiinctca 

gatLLbqhqr? 

tqgaccatag 
Ct LcL^gc:Kt 
tatgtggaac 
y n n CCClt c 
tctcaagi:** 
t tgtctttgc 

^aliyijtODtci 

ttaaaaaaac 
g t. n 1 : a x.q q 1 1 
agtgaagLgg 



r:i^yi:i:rii]c-»il 
tctgcggtgt 
qqnegngnge 
tgaaggaL^c 
sqat nnggct 
cattcaaLgt 
tgcctatgga 

tgaagaaaaa 

tattgatgea 
ttt.citoctqt 



ctgaccaaga 
qtqqaaqaaa 
gaaigaiL^tf<:« 
gecnattate 
attqotqaac 
aagfrL£Al.3* 
caaaaettag 
tatgaaaaga 

caggal:gLga 



agctgattca 

a<:t.yqt^tqg 

tcccagagct 
qryaoacagca 
taaagttijygK 
ocqaatatct 
ac ag t. q » r^o a 
tccaggatga 
agttl:gt.r?ttt 
gagecacagg 

Lhrt-qhzuiociC 

acagga-caaa 
aggaactttg 

CdMJtiJi I. « I .q 1 : 

ttctcttgga 

taagaagaaa 
qcfci f?ac 
cacLaccaat 
cccctgtacg 
a lj<: l.g m y t. y « 
ccaacataaa 

y;d h hl.KI J. ijii 



acgccccogc 

Cry it C Cz<i A 

tccgtgcggg 
1:3 t-rtt". t.+.gtrt 
tttscttttg 
m t. h yrtii »hh[: 

caaactggag 
qa n J^aa c t: qrr 
aaacacatag 
ttagagct ot 
aaaagtaaaa 
gggccjttt cc 



< jq n r u n q a crc? 
gaaga<iagaa 
qtcaaqaagt 

hhqal^qyyyy 

aagcagactt 

geteggtaga 
aactccaagc 
gqttqottojLrt 
Lgqaaatgaa 
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120 
Z»D 
240 

360 

A?.0 
4S0 

600 

SCO 

"?eo 

134 □ 
900 
9hl] 

1020 

ll-SO 
1200 
1260 
1320 
i:iHt) 
1387 
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300 
420 
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icn 

300 

420 
4B0 
540 
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gttcttctgt 

ttL1.fi I. t.qnt 

tgstgatatfa 
notqtcccact 
tgtg^tf ti:*,h 
tgttcccaas 
^ril:i ? <:i-ttt-c 
ctatatgccd 
ttttcacagt 

acgtaaataa 
qc^qcraactg 
accatggaaa 



t caaca La Lg 
cctcaagaag 
t^r-patcagt 
tcrtgaal. L.it: 
ttgatccaaa 

tgtgacaaag 

<:^tqactCDt 
ttttCatLLL 
ggtggtaata 
r.a * <^ tqtq 
aaaccagcag 
nactttagta 



agcagccc-c^: 
hqc>gctgcct 
ga Lyy Q^gud 
ttcacataay 
tgtttqactg 
atgaai:httt 
c-n.ctaccttc 
ac-ay ij.inio.tt 
caagtcri^a^ 
atqnnttgaa 
aaa Lgy y nqrj 
gcagtgaaag 
I.Lhq.^tccan 
ggaaccta^L 



ycccgggcALj 
cArttqt tgga 
gttagtL-.q* n 
rrngatttgaa 
ggyciflccaca 
tgctcctg<:y 
LanrciTaagct 
gatgacflL-rrrt 
rt.^^taaaagt 
gaaL^Liiacn 
gggaggattc 
aAL^-ftgcag 
gaaaata^.^ 
Lijqrrttttct 



qaatagaaga 
eL».?ttngga 
gctatta&MM 

aat Ly<::.>c!£>q 

^qtcttttgc 
atctacMgt-t-c 
qtgcagaatc 
LL»tj;^nt ta 
aataactLiy 
■jqaQttcrtat 
caatgol: hq« 
qtaatgggca 

caaytyqotq 



l-qaacnaacc 

agcLgl:i:^rxa 

ciaccatctgg 

a^nr:tc?gaaa 

ttggtga !.<■:*: 

Li:cracratgc 

agcaaaadittt 

l-.hq&acnaag 

gt ga La c m i irj 

.Ttgaacgace 

atrfd,-j <jyq ct 

cattactact 

aMHigtgqtca 

gga 
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<400 

t aaaa» 
tacatgccaa 
tftttqagact 
ccttgLaa^c: 
t ctaggagat 

caagcttcag 
^qtntgaga 
ggtcai^ttq 
ggcaggcaga 
tiyhctntsot 
tgcttggga^ 
trjflqatcgtg 
aaataaa Laa 



> 11 
^ccttngatg 
ag t L 1: a a a 
gttgaaaggg 
«ttqanrrnng 
gagaccfcec-i: 
iTcataggaact 

a l:tjrttc*^CC.a 

atctatctgt 
crcitggatgtg 

aaaaatacaa 

ccactgcact 

rttqqq"tcnc 



atacctgcca 
flcttttgggca 
cctgac<:l:rtrt 
tatgtgagag 

t Kciriflq-t.t.^t 

gaattctgcu 
cngpctgggc 
t c t gL y i >t. y i] 
gtggctcrata 
tcaagagttc 
aaattagac:^ 
qaqactagct 
C^agtcLggg 



utwatr^atat 
gatcrcagl: t:^ 
tcc-aagtgaa 
ggc<:Bi:3rrq 
cagcaagcag 
art«,>^tiitga 
tgaca^^l.qq 
ncttctaata 

aagaccggc<: 
Mycrjt^gtgg 
gg a a ccra^g 
Kq^cagagtg 



CJtCCCaLLrt L 

aqgttcctta 

C<T;<:;<:|-.l:qt;nn 

get aaaa ecu 
y m ,i <i c*aa n n 
gtcagcLLa^ 
ntttagcttt 

t a t £i yuActq 

ccagcaettt 
Kqqc.cn neat 
tggacacc-Lg 
^qqt^qnggt 
aqactccalL- 



ti>qnfctatgt 
gaagaattct 

drtrtyyh-qqcc 

ttgggacctt 

g ca t g a L^^: L. 
tgagataatg 

gqqagrrr.icqa 

ggtgaaaccc 

tgcagtgagc 
n toi ->.T>t n □ nt 



<?!]0> 12 
<211> $12 
c212> l a XT 



Met 


Ala 


Gly Gly 


Hi.-s 
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AlA 




Aln 


Gly 


Ser 
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Leu 
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Val 


Gly 


Gly 


Thx- 




Phe 


Ser 


Thr 


Ser 






20 










25 








3D 






Axg 


R].n 


Tbr Lexi 


Met 


Trp 


11& 


Pro Asp 


Cat 


P^ic 


Phe 


Ser 


Ser 


Leu 


Leu 
































Sor 


Cly Arg lie 


Ser 


Thr 


Le-u 


Ary 


Ar;p 


Clw 


Thr 


tily Ala 


lie 


PIlM 


"Uv 




so 
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Arg 




Ala 


Aln 


?he 


Ala 


rrt> 


He 


Lftu 


^.s n 


Plus 


Leu 


Arg 


Thr 










7U 
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LyA 


Glu 


Leu. Aap 


Leu 


Arg 


Gly 


VBl 


ser 


IllS 


A&n 


Val 


Leu 


Arg 


His 


Glu 
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90 








95 




Ala 


Glu 


Phe Tyr 




lift 


Tlir 


Pi-y. 




Val 


Arg 


Arg 


Lad 


F.I-MJ 


liCU 


Cyc 






100 










1 US 






110 




Glu 




Leu CI a Arg 


Ber 


3 X 


Cye 


Gly 




Veil 


Leu 


Phe 


his 


Ely 


Tyr 






lib 








120 










1 25 






Pro 


l a ro Fro 


Ely 




Pro 






Lyg 


He 


Aan 


ASfl 


TJjr 




ftry 




i:^0 








135 








140 










Ala 


AAp S«r 


7i/;d 




Gly 


Leu 


Ar^n 


Ser 


Thi- 


Glu 


Gly 


Cly 


Ala 


Arq 


] 45 








li>0 
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^70 
^80 
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7B0 
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Gly Asn G.I.y Thr Gin l^o- Val Leu? Srrr Cly Tltn- GIy Glu Glu Thr Val 

1C5 l?0 r?S 

Arrj Leu Gly The Pro V^.L Asp Pro Arg Ly<* Val Leu I la Vm.1 Ala lily 

1B0 DP5 190 

Kis His ASA Tr> I Art Val Ma Ala Tyr Al« His L^hfc Ala Tyr Ara He 

195 *DD 205 

Lye Glu s<-r J?r- r Cly Trp Gin Gin Val Phe Thr Pm Tyr Leu Asp 

210 215 ?£0 

Trp Thr He Glu Arg Val Ala L*u A-;n Ala Jbys Vd.l V.*J Cly Gly Ptu 
225 230 ;>a5 240 

nits Gly Asp Lys Asp Lys HeL Vul Alci Val Ala H»r Clu Ser yer IIr 

245 2SO 355 

I1& Leu Tip 5*i* VdA Gin a.'.;j? Cly Cly Ser G l.y Scr Glu. lie Gly v»1 

260 265 210 

Fhe G*r Tyou Gly Val Pro Val Asp Al* ?.n U Phe I'he lie G.ly Asn Gin 
27* 230 2BG 

Vnl Ala Thr £*r Hie Tlir Gly T.y^ Val Gly Val Trp. Acn Ala val 

2 $5 300 
Thr Gin Kis Trp Gl/i Val Gin rt 5 p Vnl Val Pro Ilr* Thr .^r Tyr Asp 
305> 310 ?15 320 

Thr Ala Ely Ji*-r Ph^ T.^v Leu Leu Cly Cy& A.™ A^n Cly Ser lie Tyr 

325 330 335 

Tyr lift A^p Mot Gin Lys Phe L*r{> Leu Ax-y Mr-t- Lys Asp Asn AAp L*:u 

3a " 34 5 350 

L«u Val Thr Glu Leu Tyr Mis Asp Pro .s»r Asn Asp Ales lie The- AJy 

3G0 .}fi5 
Leu 5er Val Tyr Leu Thr Pro Lys Thr -Sex Val Ser Gly a-sji Trp He 

370 375 360 

Glu II* Alrt Tyr C]y Thr Serr Ser Gly Ala VaI Arn Val He Val Gin 
3B5 39U 395 400 

ills Pj^> Glu Thr Val Gly Ser Cly Pro Eir> Lku Phe Cln •J'hr Phe Thr 

405 410 0 15 

Val »j..rs Arg Ser l J ro Val Thr Lys He Mel: l.nn 5cr Clu Lys His I^eu 

430 4 25 *:it> 

Val Ser Val Cys Ala Asp Asn Asn ill?; Vh* Arq Thr Trp Thr Val TJjx- 

435 440 
Arg tfhe Gly UsL II* Ser Thr Gin Pry Cly -Ser Thr rro *Leu Ala S«r 

4 50 455 4fct> 

Phe Lys Ilfc Le^j Jimj.- l.eu Glu GAw The Cln Ser Hi a Gly 5tr Tyr f?«r 

-no a75 >iao 

Ser Gly A-:ji Acp 1 1« Cly Pro Phe Cly Glu Arg Asp Asp Gin Gin Val 

Ofihi 490 4£l5 

PJ^ Tl^ Gin by a Val Val t>ro He Thr Asr> LyA T^n phs Val Arq L&u 

^□O 505 S10 

s«r Ser Thr Gly Lys Axcj He Cys Glu lie Gin A] a Val Asp Cys Thr 

aiS 5^0 52b 

Tfrr He Ser Ser Ph& Thr Val Ar<| Glu Cyw Glu Cly Bex Ser Arg Wet 

±30 535 i>40 

Cly Her Arg Peg Arr? Rt.cj Tyr l^i v Pjic Thr Cly Hi a "J'hr Asn Gly Sftx 
5*-5 SS0 555 560 

lie Gin Met Trp Asp Leu Thr Thr Ala Met Aep Met Val A.^ji l.y-; Scr 

SGfi 570 570 

Glu. fi 5 p Lys Asp Val Cly Gly Pro Thr Glu Glu Gl.u ].i?n Ldu Lys Leu 

5B0 595 

L«u Asp Gl?i Cyc Asp Lei? ni-r Thr- Snr Arg Cys ^la Thr Pro Asn He 

5&5 6GO G05 

S*x- Pro Ala Thr Sex Val Val Cln His Ser. His Leu A_ixj Glu Scsr Asn 

tlO «15 620 

Sex Ser Leu Gin Lftu Gin Hie His Aep Thr Thr Hi..-.; Civ Aln Ala "Thr 

630 62= 6*0 

Tyx Gly Si-c Wo-t Ary Prxi Tyr Ar.-q GXu Ser Pro Leu Lou Ala Arg Ala 

fiflS £hn €55 

ATrg Arg 'J'hr Glu aer Phe his Ber Tyr Arg Asp Ph& GIji Thr Tlo At:n 

C60 C65 670 

Leu Asn Arcj A^n v^l Glu Ar-q A3Lm VjiI prp Clu Asn Gly Asa Leu Gly 

67S 660 055 

Pro Jlc Gin Aln Glu Val Lys Gly Ala Thr Gly Glu Cy£ Asn 31ft Ser 
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Gly Gly 
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Gly 


Gly 
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Tyr 


.Leu ai* 


Sur 


Ser 


I'xo 




The 


Ser 
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77& 
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Thr Aln 


Sex 
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Sex 


P/.-i > 
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Ly.^ Scr 


Asp 


Ser 
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Cly 


Cln 


Glu 
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167 
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7QO 

S^r Leu Ar^ Glu Leu *sp 
IIS 72D 
Cly The flr-r Clu Sex* Ly* 
735 

Lye lit- Clu L*he Ary J.yo 

75Q 

Gly Arq Lys Ly* v.?l Pro 
7G5 

Gly Cly Thr Asp Sor Pro 
7SO 

Tiir Pro Sex Ptn> Arq His 
7?D hc>0 



<213> Komo aapieyjfc 
C400> 13 

Leu AJ.» Sor Ala Arg lie Px« Gly Arq ± a ro Tyr V«l Cly Val Le\j Aiy 

1 5 10 15 

Pa-i? Pro Gly Ser Leu Phi* Ph* Ala I'xo Ala Ely Thr Thr lie fi c *r Aid 

20 25 ? o 

Lou Leu Gly Phe Cy_^ Tyr *Jet Ala Ala Phe I-mj Lys Met Sex Vnl Ror 

35 40 45 

Val Asn PUe Phy Arg Pro L>he Thr Ary P«o Leu Val Pro Pho Thr Leu 

50 55 en 

His Axtf Lys Arg Asn Asn Leu Thr 11* l.r^n Gin Arg Tyi- Mc-t Ser 
G5 70 v±> BU 

Lys 13 « Pro Ala Val Thx: Tyr Pr« T.y? Asn Glu Ser Thr An Pro Sax 

55 t(0 <?5i 

Glu Glu Leu Glu Leu A«p T.y* Trp Lys Thr Thx MtH: T.y.^ £*r Her Val 

100 l<)5 ]10 

Gin Glu Glu Cys Vrtl .Ser Thr lie Ser Str Ger l.y 3 Asp Glu Aep Pr^j 

U5 120 125 

Leu Ala Ala Thr- Ax-q Cl\i Phe lie Glu Met Txp Arn Le=u Lew Glv Area 

l^n 13b 140 

Glu Val Pro Gly Hie lie Thr Glu Glu Glu L*-u T. V p Thr Leu Met Glu 

155 ISO 

Cys val ni?r A^n Thx. Ala Lys 

<2in> "i4 
<211> 
<312> PRT 

<2l3> Homo sapiens 
<400> .14 

rro eye Glu Gly Gly Ala Arg Ser Cya Leu Val Thr Glu Ser Ala Arq 

1 5 10 15 

Gly GJ.y l.cu Gin Phe Leu Gin tin Cys Asp Attj Glu Asp Leu Val Glu 

2« 25 3D 

Tjpu Ala Leu rro Glix Leu Ala fiin v?il Vnl TKr Val Tyx Glu Pint T-ito 

35 40 45 

Leu wet Ly* V0I Glu Lys Asp J3i9 Leu Ala Lya Pxi> P*k% Phe L^ro Ala 

50 55 CO 

Z±* Tyr Lya Glu L*he Glu Glu Leu Ms Lys Vnl Lys Lys Wet Cys 

65 70 BO 

Gin Asp Tyr Leu Srv Sur S<?c Gly Leu Cys i;er Gin Glu Thr l/z V ci\K 

f5h 90 
lie A*c3i A^n Asp Lya Val Ala Glu Ser Leu Gly T.lo Thr Glu The l^eu 

lOO 105 no 

Arq Ly* Lys Glu lie 111* Pro Asp R&vi I.t^ Cly Pro Lya= Mis Leu Sex 
115 120 125 



50 
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Arg Asp Met Asp Gly Glu Cln Leu Glu Gly m.« Ser Glu Lye Am 

130 H0 y ? 

Glu Arci Glu Ala Ala Cln tilu Gly Leu AJ* Ser v*.l Lys Arg Pro Arn 
145 150 155 161] 

5 J^rg Glu Ala J-™ Sar Aan Thr Thr Glu ser Lpw A i a Al* Asu ^er 

16s no us 

Arg GLy Arq Glu Lye Pro Arrr pro Leu His Ala Lou L'jto Al/j civ L'he 

1S5 190 
Ser Pry Pro Val Aan Val Thr Vai ^ r Pro A^q Ser Glu Glu Ser hie 
10 200 205 

Thr Thr Thr Val Gly Gly Aaji dy Ser V.tI L>he Glji Ala Gly Pro 

21. s 2S0 
C^ln L«u Gin AIA L<n* Ala Aan Lkl? Clu Ala Ai- n Arc Gly fii;r lie Glv 
22 * £3 » 23* 24 0 

Ala Ala u™ Ser Ser Arg Asp Vnl Ser Gly Leu Pro Val Tyr Ala Gin 

2*5 250 255 

ser 61 y Clu L>rt> Arg Ar.y Leu Thr Gin Ala Glr> v*l Ala Ala Pin- p ro 

2G0 265 270 

C7Jy Glu Aan Ala L«u Glu His s&i.- Scr Asp Gin Asp Thr Trp Am S*r 
20 275 280 2$S 

Leu Arg Ser Pro Gly Phe Cye Rnr Pro Leu s vr Ser Gly Gly civ Ala 

2^0 295 :mn 

Glu Ser: Lgu Pro I a ro Gly GJ.y Pro Gly 111-; Ala Glu JUa Gly HLs Leu 
305 310 31* 320 

25 Gly T,y 5 Val Cye Asp Ph* Wis Leu Asn Hi.? Gin Gin Pro Ser Pro Thr 

325 330 33.5 

Scr Val Leu Pro Thr Gl.u Val Ala Al* Pro Tro Lmj Clu Lys lie Leu 

340 345 350 

S£er val aap Gr, r Val Ala Val An P Cys Ala Tyr Arg Thr Val p r .u i. v « 
30 35D 360 365 

Pro GJ.y pro Gin Fro Gly Pto His Gly Ser l-cu. Leu TJi- GAu m v Cvs 

3?0 375 ^0 

L*u Arg Ser Leu Ser Gly Asp Leu Asin Arq Phe L^ro r y .-; Gly Met GLu 
385 390 39f> ann 

J> Val His Ser Gly Girt Az-q Glu Leu Giu Si^r Val Val Ala v.tI Cly GLu 

405 410 415 

Ala Wet Ala Lt>u Lya Phe L J ro Wet Gly Ala Met S«r Tyr Cya Leu Arg 

420 42b dim 

Asp Arg Ser Arg Phe Leu l J he Arg L«n Pro Wet Gly T.ov .^r Cye L>ro 
40 435 440 4^5 

Leu Gin Val Gl.n 

<2lD> 15 
-35 <211> 321 

<212> PRT 
<213> tiomo Bdpir*n* 



^Oto 15 

Wftt Ala Glu Ala Glu Glu Ar.p Cys Hia- Ser A*p Thr Val Arg Ala A.sp 

1 5 10 

Asp Aap Glu g.Ui A W Cln Ser L>ro Ala Glu Thr Aep L*u Gin Cln 

20 25 30 

ubu Gin Met Phe Arg Ala Gin Txp Met Phe Glu Leu? Ala Pro Gly Val 
53 lb AO 

5«r Ser tier Asn Lfru Glu Asn Arg l a ro Cy?s Arg Ala Ala Arg Gly Rfrr 

50 .S5 eo 

Leu Gin Lys Thx Ser Ala Asp THt- l,y^ Gly Lyi Gin Glu Cln Ala Lya 
* 5 70 75 ran 

60 Glu Glu Lya Ala Arg Glu I.imj Php Leu Lys Al^ V^l Clu Glu Glu Gin 

65 90 95 

Asn Gly Ala Leu Tyr Glu Aln lie Lya Pht? Tyr Arg Arg Ala Met Gin 

100 105 110 

Leu Val Pro Asp He Glu Phe Lys LJ n Thr Tyr The' Az-q .?cr Pro Asp 
" 11*> 120 125 

Gly Asp Gly val Gly A.^ Sor Tyr II Glu Asp Asn Asp Aep Asp Ser 
13H 135 14Q 
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l.y* M t Ala Asp Leu Leu R cr Tyr i'h* Gl n Cln Gl.n Leu Thi- the Gift 

Glu ser Vul L u Lya Uat Cys Gl/i Prr, Clu Leu Clu Ser s cr Gin 

His lis Sor Val Leu P r r? Met Glu V.?l. Leu Mel. Tyx I2« PK* Arg t™ 

iGO iss l0tl 

Val u«l Ser Asp Leu Aep Lm.j Arg Sex Leu Clu Girt I.cu ser Lin, 

l»H 200 

W«l Cys Axg Ely |»h« Tyr 31* Cya Ala Ar-c> A*p Pro Glu He Trp Arq 

21J> 

Leu ALa Cy A 7. cu Lys v*l Trp (Sly Arg !3< !r Cys lie T. V s Leu v*l Pro 
^ s 230 

Tyr TIj* rjrsr Tip Arg Gl.« Met Phe L*u Gin Arg F,.-r> Arg Val rtro phe 

ft-^ Cly Val Ty* Tip Ser Lys T1,v Thr Tyr ft r <, Gin GJy Clu ulr, 

2tO 2f,s 270 

Ser Leu Azp Gly the Tyr Arr, Ala Trp Mis Rln Val Gl.j T V r Tyr a™ 

275 2 HO 2B5 

* ryr At "* Phe Ph& p *» *»P <5iy Hie v*l Wot Met L*i> Thr Thr Prr> 

2^0 7100 

Glu Pro Gin Ser- ll Q VaL Pro Arg T.^u Arg Thr Arq Glu. Tyr- Gl n 

Si0 :«5 32» 



<210> lfi 

<2ni> L72 

<212> PUT 

<213> Homo s h plena 

o&00> lfi 

Aid Cys Pro 3'kr His Pji> Pro l J ro l a ro Thr Ala Pro Arg Gly Hi -s On 
1-5 ]0 15 

Cvs Arg i^eu Gly Asp s*r Trp Ser Val Ly* Sor Arg Trp Cy-. pro 
^0 36 30 

C V P Pro Gin Glu 1.™ Cly Trp Ala Set Cy* Al* Arg Gly Val Cy* Ala 

35 40 45 

Trp yer Phe r:y* Ala Trp lie c y * rin Pho He Ala i^u r-eu Val Glu 

™ -W fiO 

Thr Pro Arg Pro Val His Pro Al* Ly* Thr 1'ro Gin Ala J\] * Cyg Glv 
? S 70 75 ' no 

£f*r Aj:<j Thr I, &u i=> r o Pro Ph* Pro *r<3 Cys L>ro Leu Arg ftlo Arq Ala 

OS 9ft 9.5 

Al^i Th r Cln cyi Tt P Lou Arg l a ro rro Leu fily CIji Ala Leu A2m 

*00 IDS 310 

Gin pro Ala ulu Trp Gly VoL Val Gly Gin s*x- Pro Ar^i aei Trp Al.^ 

13 121] 1^5 

Pro Ala Gin Al ft Hi3 Arg Ala Arg Pxo Jlis Pro Ala Ala Pro rt-rn Thr 

] ?f> 135 140 

Ala Thr r\r< 3 Gly Val Leu Fro t.ei\ 3 Cya Pro Als Pro fily Thr Asn B&r 
145 150 155 160 

M*l. Phe Gly Val t:y* T jC u Cys Lbu rh& Phe Ly.^ T- v ^ 
ItS 170 

<210> 1? 

^211 > ^72 

<212> PRT 

<213> noma ^iipiens 

<4 00> 17 

Mr-t Axg Glu Ar^ Phc Arg Asn Leu Asp Glu Glu Vwl Glut Lys Tyx Arg 

15 10 15 

Ala VAl Tyr Asn Lys Leu Arg Tyr- Glv His Thz Phe Leu l.y^ Sor Clu 

30 

Ph« Glxi His Gin Ly.r; Glu Clu Tyr Ala Arg lift Asp Glu Gly Lys 

35 <0 4s 

Tl.o Lys Tyr Glu Rc?r Clu He Ala Arg Leu Glu Gin Asp Lys Glu Glu 
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^ 60 
].nu Arg Asa Girt L«u i.^u Aan VaJ. A.^r. u© u Thx T.ys Aap 5a r Lys Arg 

" ?D ™ 00 

VaL Glu G.l.n l.^u Ala Axg GLu z. y3 Val Tyr Oys GIj. Lys Leu Ly« 

0* 90 05 

Ely Lc- U Clu Ala Glu Val A.l ;> Clu Leu Ly* Ala Glu Ly.^ Glu Asr> S^r 

100 105 H u 

Glu Ala (Jin Val Glu Asr. Ala Gin Ax.- S Tie Gin v*l Arrr Cln Leu Ala 

115 120 12Ej 

Glu Met GIji A.lw Thr VaL Arg s*i- i.qu Gly Ala GU> Lys Gin Ala 

130 T35 1*0 

Aen Leu ftrn. Ala Clu Arg Leu G] u Lys Glu Leu Gin Ser aer Sctr Clu 
US l.SO 155 iB<) 

Gin A.-:ti Thr [ J he Leu lie- A^» T.ys Leu His Ly-i ft i a £i u Ar<5 Gu , 3ie 

1 65 170 17b 

Asn Thr Leu Ser Ser Ly* Val Lye Glu r^u Lye Sis Ser A.™ Lys Leu 

190 135 150 

Glu lie Tlir Asp Tie Lys Leu Glu Thr Ala Axg Al« T.yr; S&r Glu L«u 

19& 200 205 

Glu Arcr Glu Arg Asn Lye Leu Gin Ser Glu Leu A*p Gly Leu Gin ,<^r 

210 215 22D 

Asp Aen Glu rl* Le-n Ly.*= Ala Ala Val Glu His His Lys Vai Ltjsn Leu 
?25 230 23b 240 

Val Glu F.y-f Ai>p Ar n Glu Leu He Axy r. V 3 Val Gin Aid A.l..> Lys Glu 

2d* 250 2bh 

Glu fily Tyr Gin Lys Leu Val V*l T.cu Gin Asp Glu Lyw l.m> Clu Leu 

2t5 270 
Gl.u A^n Arg Leu Ala Asp Leu Glu l.y 3 Met Lye Val Glu Hiy Asn VaL 

260 285 
Trp Arg Gin Ser Glu LyA As 3 > Gin Tyr Glu Glu Ly^ i.cs U Axq Ala Sex 

290 2^5 30U 

Gin Met Ala Glu Glu III' Thr Axq Lye Giu L*.> Gin Ser Val Arg L*u 
30* ^10 315 320 

Lya Leu filn Gin G*n Ho Val Thr He Glu A.™ Ala Glu Lye Glu Lys 

325 330 335 

A Ail Glu Agti Ser Asp Leu Lye Gin Gin Tic .Ser Ser Leu Gin rj*- <>]n 

340 34 5 350 

Vrtl. Thr Ser Leu Ala Gin Ser Glu A.*n Leu Leu Aen Ser A.^n G^a 

355 360 305 

Mcit, Leu Lys Glu Met Val Glu Ar-tj Lku T.ys Gin Glu Cye Arg Ann Phc 

370 snn 
Arc? Ser Gin Ala Glu Lye Ala Glu l.r_ u Glu Ala Glu Lys TLx l.r.u GLu 

390 w$ <GD 

Glu Lye Gin IJ o Trp Luu Clu Glu Lys llifi T.y.s l,r-u His Asp Arg 

^OS 410 41S 

Tie Tlir a^p Arg Glu Glu Lys Tyx Asn GIji a^3 Lys Glu Lys Leu Gin 

425 J^O 
Ar<j Ala Ala lie Ala Gin Lys LyA Arg Lys Ser Leu jais Glu Aaji Lyrs 

^35 4^50 445 

Leu Lys Arg Leu GLn Glu T.yrs V^l Clu Val Leu Glu. Al* Lys T-y.7 Glu 

450 <iS5 4G0 

Glu l.cu Clu Thr Glu IU Arg Ser 

<2]0> 16 
<211> dfi^ 
<212> PR7 

<213> Jlom-D 1 sapiens 
<4Q0> LB 

Lys Ala Thr Ser Leu Thr Leu GLu GLy Gly Aiy i.«i> T.y? Arn Thr Pro 

IS 10 LS 

GLtl Leu lie Hie Gly Arg Asp Tyr Glu MkI. V^l Pro GLu Pro Val Trp 

20 2 c j :-it> 

Arg Aid Leu Tyr HA.'r Trp Tyr Gly A La Asn Leu Ala Leu Pro Aa'g Pro 

35 40 45 

Val IJ.« Lyr; Asn Sct Ly^ Thr Ar-p lie Pro Glu Leu GLu Lfcu Phn Pro 



in 
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so Sii to 

Arg Tyr T.^u Leu phe L«u Arg Gin Gin Pro Ala Thr Arcs T]i r Gin Glr> 
* 5 70 75 BO 

5*1- Ash He rrp v»l Asn Met fiJy Asn Val Pro £fer Pin Asn Ala Pr^ 

6& 90 95 

L«u Lys Arg val T.cu Ala Tyr T)>r Cly eye Ph,<? Ser Arc? Wet Gin Thr 

100 ICS " in, 

lie Lya Glu 1^o His Glu Tyr Leu Ser Girt *rq t*u Ait He lvs Glu 

US 120 120 

Glu Asp M** Arg Leu Trp T/ni Tyr Asn Sex- Clu Asn Tyr Leu Thr I.<^ 

13S no 
Lftu A->p Asp Glu AAp His Lys Leu Glu Tyx Leu l.y* He Gin A 0P Glu 
14 J J 55 160 

4>1 n His Leu v«l. Tic Glu Val Arq 7V*n Lys A*** Met Ser Txp Pxo Glu 
25 16i 170 17b 

Glu Met S*r Phe He Ala Asn S^r &er Lys II & Asp Ara His Lvs Val 

ifO 105 i9D 

Pl "° T hr Glu Lys Gly Ala Thr GLy L*u J?€ jr Asn Leu Rly Asn Thr Ty; 
1^5 200 20b 

20 Fh<* Met Asn Sex- S<;r He Gin Cys V*J. Sor Asn Tnr Cln Pro Lt-u Thr 

215 220 
Gin Tyr rb* n.«; Sor GLy Arg LI Is Lc-u Tyr Glu L«u Asn Arg Thr A?n 
^ 2S ^30 235 2J0 

IL * Ely Met Lys Gly Hi-, M 0 t Ala Lys Dy* Tyr Gly as p Leu Val 
24 245 250 2SS 

G.l.-n Glu Leu Trp B«r Gly Thr Gin Lye A^ji Val Ala Pro l.^u Lya Leu 

26D 265 270 

Arg Trp Thr His Ma Lys Tyr Ala Pro Ar^ Phe Asn Gly Phe Gin Gin 

2"?5 290 265 

Gin Asp Sctr Cln Glu Leu Leu AJy Phe Leu Leu A.sp Cly Leu ui'-s Glu 

290 295 300 

Asp Leu Ar, n Arg val Ells Glu r-y 5 Pro Tyr Val Q.li> Leu Lys Asp 

310 315 
Asp Gly Ar^ Fro Asp Trp Glu v.?l Ala Ala Glu Ala Trp Asp Asn 
35 325 335 

Leu Ait? Arg Asn. Arg Cex- 11^ Val Val Asp Leu Pli^ His Cly Gin Leu 

3^0 34_-> ?J50 

Arg 5cx Gin Val Ly.^ Cy? Lys Thr Cys Gly ui« Tlo £fer Val Art? Phi- 

liftO 

Pro The AiJi Phr; Loxi S*r Leu Pro Leu Pr.o Met Asp Sea* Tvr McH- 

37* 3ao 

Ki3 Leu Glu Tl.o Thr Val He Lye Leu A*p Gly Thr Thr " p™ Vjal Rrq 

^5 3P0 3BS 400 

J'yr Gly t*hi Arg Leu Aai^ Msl. A.^p* Lys Tyx Thr Gly Leu Lys Lys 

Gin L W Jer Aep Leu Cy« eiy l.cu A^n Ser Glu GIji T1<j Leu Leu Ala 

<20 42i a;™ 

Glu, V^l Hie Gly Sai- Ann He Lys Asa Phe Pro G.Ln ftsp Asn Pro Lye 

435 445 

Ser Thx Glu T.ou Leu Ser Glu Val Gly Ph* Phe Cly Val Prn 
-950 455 4&0 



,10 



40 



3* 



<5LI3> 19 

55 <212> PR.T 

<l213> JJdotio sapiens 

<400> 19 

Pro Tl.r Trp Leu Glu Ar*r Gly Gly Gly 01 u. Pro Ala Ala Ar- C? p r <? Cln 
GO 1 5 1U IS 

Thj- Pro Gin Pro TWl- Al.w Pro Glut Ser Arg Gly Pn> r?cr Cly Ala Sar 

20 as ^0 

Alci Leu Arg Cy.-? Arg Gly L*xo Thr Ala a*<j s^r Leu Pro AIjj 7\]& .5or 

35 40 45 

65 Wet L*u Gly Ala rro ftsp Glu 5«r v*l Ajg vsl Ala V n l Arg 11 

50 5S 60 

Arg Pro Gin Leu Ala Ly* Clu Ly.? Tl^ Glu Gly Cyp Hig He Cys Thr 
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65 ?0 "?& 80 

Ser Val Th.r Vr<? Cly Glu Pr*> Gin Val rhe Gly L Y * Asp Ly* Tile* 

90 95 

l?h* TJir Phe Asp Tyr Val Phc Asp lie A.% E i £er Gin Cdn Glu Gin lie 

lf>0 105 no 

Tyr Tie Gin Cya lie GLu Lya Leu l±« Glu Gly Cy* Phe Glu Glv Tvr 

H15 T20 125, 

Am Ala Thr v*l Phe Ala Tyr Gly Gi n Thr Gly Aln Gly Lv'j Thr Tvr 

135 IdO " * 

Thr Mel. Gly Thr Gly Phe A*p V Q 1 Asn He V*L Glu Glu Glu Leu Gly 
1^5 150 IS* 16 5 

Il«; He Ser Arg AIa Vn\ Lys= bis Leu Phr? Lys Ser Jle Glu Glu Ly- 

1G5 17& 175 

T,y;5 his lie Alb JIk Lyo Asn Gly Lwu Pro Ala Pro A 5 p L J he Lyft Vn 1 

1B0 18& 

Aan Ala Gin Pho Leu Glu Leu Tyx- A G n Glu Glu V:il Leu Asp T.<*v Phe 

195 200 yci5 

Asp Thr Thr Arg Asp II* A?>p Ala Lya £ier i.y-s Lys Ber A^n He Ar*r 
210 21S £>!0 V 

He His Glu Afsp Svc Thr Gly Gly Asn 1..n> Tyr Cy* Gly Arq l>he Glu 
22* 25ft ?jo 
Kls Val L*u 



<21U> 20 
<211> 223 
<212> PB.T 

<2 1 3> h como le j i s 

Gly Asn Tyr Arg Pre? Ser Ly t ; l.y 5 Arg ['he Arg Glu Cly Lys Glu Lys 

1 5- 10 15 

Lys. Arg Gly Lys Lyrt Val Trp Leu Val Lys Arg l>o Jle bin Thr Ueu 

20 25 :iD 

Thr Lys Arg Ary J.ys Lyc Lys Arg Val Phe Alcj Lys Glu Lys Fro Asn 

35 OH 4^ 

Glu Leu Glu Vnl Glu Glu »er Gin Glu Val Rcr Asp his Glu A*p Glu 

50 fi!> to 

Glu Glu Glu Glu Glu Glu Glu Glu Asp Asp He Asp Gly Gly fil.u Sr-r 

70 " 7b BO 

Ser AAp Glu Ser Asp Ser Glu Ser A^p Gl v Lys Ala Tyr Gin Al*> Asp 

90 95 
Leu Alia Asn He Thr Cys Glu 11^ Ala Il« Lys Gin Ly* i,ku lie Asp 

100 10L 110 

Glu. Leu Glu Aj«i Pi^t: Gin Lys Arq Leu. Gin Thr U:n Lys Lys Gin Tyr 

115 121] 
Glu Glu Lyy Wet Wet Leu Gin his Lye 11* Tir.c? Asp Thr Gin Leu Glu 

130 135 100 

Ary Asp Gin Val Leu Gin Aan Leu Gly Ser Val Glu Ser Tyr £cr Glu 
14£ 150 355 160 

Glu Lya Ala T-yy Lys Vol Arg Ser Glu Tyr Glu l.yr* Lys Leu Gin Ala 

lb* 170 1"?5 

Wet Aisn Lys. Clu Leu Gin Arg Leu Gin Ala Aid Gin Lya £lu His Ala 

ISO 1G5 190 

Arg l.cu Leu Lys Aan Glrs Ser Gin Tyr GXv Lys Leu Lya Lys L&u Glu 

19* 200 205 

Gin Asp Veil Mel. Glu M^t Ly* Lys Thr Lys Val Arg Leu HeL Lyy 
210 ii.s 220 

<2ll)> 21 

<211> aOfi 

<212> 1 J KT 

<:21S> Homo sapiens 

<A00> 21 

Met Aan. Lys Prr? 13.*- 7hr fro Ser Thr Tyr Val Arg Cyr, T^y Aptt v^l 
1 & 11] Li 
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15 



2D 



25 



30 



35 



40 



45 



SO 



55 



«0 



6* 



<^ly Lou lie 

Lys Lye l^eu 

Asn Girt P)io 
50 

Srsr Pro Thr 
6S 

Val Gly Asp 

Ala Eer L«u 

Pj;<j fi^r Lys 

Asp Lys Asp 
ISO 

Tyr Met. Pj-ij> 
145 

Sex Asp Thr 
Thr Aan Asn 

Gly Glu Ely 

135- 



AlA V^l 

His lie 

iter Glu 

Leu VAl 

B5 
l.n;^ Leu 
100 

Olu Ala 

Arg TJix- 

Pro Asp 

Arg Pht 
1G5 
Pll* A*p 

ieo 

G?y Pti* 



J.t?u Ser 
Ala lie 
Arg 

5b' 
Li-su Leu 
70 

Asp Leu 
fro A£p 
lie Thr 
l.i^ut Met 

150 

J3Ls S*r 
Glu Arg 
Ciy Val 



l.ys Lys 

L'he Glu 

PL*; J\*;p 

Lfcu T 1 o 

Ala Val 
IDA 
Val tiln 
3 *D 

Tiir P i<s 

Sor Pro 

Phe Ser 

Pro 11m 
165 

200 



lie Aip Pro Gin 

l>ro Stir Cly Asp 
45 

L©q Leu Gin 

$0 

Trp Gly Thr Thr 
75 

Gin Asn Glu PYtv. 

Pro Thr Ala 

Gl/j T-y^ Cln Met 
125 

Val Cln Asn Leu 
HO 

Glu A3 Ly.-J fScr 
15!> 

PJ}* Tyr- 01 U Leu 
170 

S«r V^l <^ly Gly 

T.ys Cly l^au Arg 

20b 



Glu Cly Trp 

AAp Arg 'J'yr 

Thr Gly Ly* 

Asn Cys TJir 
30 

Phe Ala Pi:n 
95 

Asij T3ir 3,^u 
"110 

Pro Phc Cys 

OJw Gin ser 

Lexi Glu VmI 
160 

Lys A«n Val 
175 

Asn l.y^ fret 
190 



<210> 22 
<211> 1260 
<112> DNA 
<213> Homo 



^00> 2^ 



cttct cogca 

SCTtCttaagy 

aagtctctgg 
at gatggatc 
gAL.t.A k l-.^tg 
gaaggggaga 
r=ctcnf?ott &g 
attctta^gt 
ggatgtaatc 

agtggtgtg* 
aatgatagtg 

tccatgttga 
qgcmttgaga 

accctgtggg 
anatttaaaa 

cttggtttaa 



cgactgttac 
»«ji3aotcat 
tagtgggtgt 

c:k.q?tqCI0LCt 

tgcagcatgg 
tgaccgccaa 

gt.C:4<]i:ti?P? 

tcasgtacag 
"t.Qttac^tg 
tggsttctg*: 
■crccsttggstg 

A £ ijt qq^rtqg 

aaggatatac 
Ana p t-ctat c 
atgaagtct t 
oiccagaagct 
af:ALi;r_!?iJ5 3 
actacaatgt 
tttgatcttc 
fci+^ttgcnnt 



agftgqtctcc 
tgcaccagtl. 
tgc3rt<-.rt.T>qtt 
tgctcttgtg 
wttattt 
ttctaaga 1. 1, 
tctggtgcag 
tcrL-l.^.-j ktn;o 
gaaactaagL 
I*, n ij a 1 1 1 og c 
ga 1 1 1 1 gg g ij 
tgtttctctc 
cjqs-nqtqcat 

caggattcat 

aaaggatgaL 
riqacctaaaa 
gattaar:l:c3C{ 
ttcaatatgt 
ctgagctctt 



<210> 23 

<211> 3151 

<r212> DhJA 

<213> Jdojno spa lens 



<40O; 
teiacacagtt 
tatg^A^^»M 
cctccatcag 

agcatcccca 
gact ttgact 
g a I: a v.tzA ; bt.q 



gt-gaaaagag 
cqDtaatgga 
gg t gyy.'ijiy.** 
gcaacgtgtt 

t3C9C:CC0LCCg 

cggccaccaa 
csgccgaoga 



atggaLgtgg 
ctccactaca 

tgagtca^gc 
catccacgtg 
ctcctgcasg 

gagcttctcc 



agagccttct 

qgtatggcgt 
g^L^M kt.gr? 
ct tea ga cac- 
ti h .^q 1 1 n t n n 
agaaaLgttH 
ntcataattg 
y ^."J I. Lrt^rn 
taccttatgg 
■jwrK^t-qqcci 
caggaal.liy/i. 
aagatggtgg 
rtttarrnttaa 
cccgtgl.irMM 
ccatgtatcc 
cjaggLtgr-tC: 
gacctgtgac 
a^ccttt ngt 
gaal.LL<isjs?i; 
gaacaaat^d 



gLL'.i:r:flrrtcc 
gcgactgctij 
taogatgaga 

gagatgaagt 
gaga cc t tea 

caggLggaci: 



CtC ti_-E:l.cjt.q 

aqqcaacagt 
gtge^-.d hci.i q 
ctaqataagct 
C Liaia m t rrt- 
ctgcaggagt 
Mkgt.cttonEi 
tggtttccaa 
nncaccgr.gt 

actcaagtgt 

ttgaarfLjl.yr: 
gtgtggctga 
crt.^ tq q t □ a a 
tcaatg tCrLry: 

o^ctcaagaa 

tagllgrtqr-trs 

tttcatccat 
tr^^cag^atc 
aa1. L«rt-cb<> t 



tagccctgcc 

acatgaacac 
r^qota&ggac 

acctctat1.il 
agaatocatg 
1:L*yykqo^C<| 



c^naaatggca 

r^ttctggga 
taaag^a^A.^ 
qqc^rgataaa 

attcattatt 
c:<:oc»qtggac 
gattggaagl. 
trj^cattcat 
ggi-l:gi.^<.y.j 
gggaactgac 
■;tntgaagto 
tctta l. \.<jm& 
qgggatgtat 

aagtgcagal. 
t.aggotgtag 
gtacatyy^ k 
aaagcctatg 
tgtagtgtga 



tgtgtgcact 
got ggtgcat 

caagtttatc 
tyoctgeage 

ggtgaaggtg 
cgtcatgaaa 



60 
l^n 
1&0 
240 
300 
3C0 
420 
ABO 
b40 
GOO 

720 
780 

1020 
1090 
ll^O 
1200 
1260 



60 
120 
1 15 □ 
240 
300 

420 
4B0 
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ate»«Mccci agytqcqgag ctl:<:qqacct gLyteecgc* qcqgcU:r:ta 
onggactatrj gcggctge*t q t ccct fc« fcc gccgtynqtg t <r: I : t c t a erg 



10 



15 



20 



2S 



1\) 



35 



4ft 



45 



50 



ccfi^ d tester ag^aeggege catpttccag y.i.^ccct^t 



Ctg 

cggggy<:tqa 




CC I: yy^ct t c 
eaagtgccor: 
g^yiMioatcg 

yKqcaaagca 
gacttttr.^g 
^nettctgaa 
ccccrctyq^c 
<:#atgagace 
ccl.oqtctac 
cggggaort*it 
i:.iqtgacctg 
-Milage 

gtggdtr.-j^ca t tdi:nctgtc gl.gqtcccag 
tntgagctgc: *gtac:taC:y.? gaagca^uq 
^:rrwcca ^<-c*^qqtcac nqtgcagggc 
tgtL-l.l.r*r^aq gtgcggqi^c gca t:< :qt qgc agtjiVl.noqgq 
cgcta^qcq gcaagat 9 l:» cttccagaee *fcqacagaag W q at t a e<: ri qacaagcaLc 
^aggagaagL tqccactcat L. rqq ct cc tegguoqetg g c t r^t ct t c<:l-.^ctgct 



qccaa^Hog gggatgagqo r?t gt a cccac tgtccc*tc?a a tvig^cqqa c 
gyyqccacca atrtgtgtctg c.Mjcantggc Lantacagag cngacctg y .> 
atgcor^tgca faacKwdcco ctcc^cqccc caggr;l:qtga tl.fcr^agtat 
tccctcdLcjc tggagtgga.: occtccccg.: qactccggaq gccgay* IT q 0 
a^catcatrt qr-nagagcLg tgqrjtcgggc ^gqqqtgcrr. qcacscgcui 
gtaiLiagtacg CAi^acgcca g 1 1 a y . jeet g acc^yccac g<:«*-ttacat 




ccagaiirjicigc ccaatggnyt 
r?t cag tgacj I. # a cgeca c 



rjatcctgga<: 
ayocataaaa 



atuhcotgtg tcaa**htqa gcaggtgaL^: qqagcagggg nirtttggtg* qqtctgcagL 

ggccac^L.yn agetgecagg .^gngagag atel.htgtgg cirwtcoagac g^t.c^qtcci 

gqetacaegg aq^agcagcg ccg^Histtc ctgaget^*g cctcwh^t 
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tcitggaaaLg 
g^baataatg 
gate n-h recite 
cccaaoaact 
gal:gtqnctt 
ga gga t c £j t b 
q*:acagagct 
aaagAl-.q^ng 
gaagaagaiH 
ULqcaaaaat 
accbgattga 
tcacctaacy 
tb^aatttgt 
atggr^.ttgc 
ctgatcag^t: 
csuscaactta 
gatgAtiintg 
gtgggagggy 
Lcrpqcgagc 
eg a gt gg& i\tz 
at cccttttg 
Lil;acttttt 
ttg£jMh< 2 ctg 
gaacgaag^j^ 
wqo»ct casta g 
gcta1:£c3c*an 
ggagttgaLg 
tccagata 
tcttct.tttq 
tacaataacL 



cttgLaaaHi^ 
cagctgeaaa 
Mrtcj^ctgcinc 
cagc^gcL-Ml: 
cctccaagaa 

rirttgr.itcrtqa 

acttaacaga 
ccoctttaat 
tCCyy^.i^Ol. 
^caaagatga 

t agge^L: l:y h 
tctctaatga 
t^aatgtga 
teatt^tegg 
cattsttttg 

tt act tat Ad 
csattgttgc 

tgacacttca 
cccit gijnaac 
aagfjg I: I: \.» K 
tcaaagtaga 
tca^aaagaa 

ttsgacgtga 
ate-ctgeaga 
«y<iqaqqtrjt 
ttggLdtgl: I: 
aaactgaggg 

I: Kc? c t qq d 



/j L<:-D^qjiqnci 
gcatctaatd 
gaatgagtt t 

aggagcaagc 
qntaaaaatg 
agaga^g^Lri 
ccgtgttatt 

I.^m rjr:,T wr.i/if j 
agatgaaaag 
t tcctcaagg 

UgrthqKqbiii-. 

aaaaattgaa 
ettgaegtat 

iiA l.gg*agttAl. 

caaagegatg 
ntneotat-gcot 
dgr^ataag^: 
tgcttcgaag 
anntencaot 
gg a a c-t 1 1 1 g 
tgnnct tqgt 

^dcagagaac 
t ttgggatla 
ctacagcit t n 
ccdt^tcaL^ 
cttgaagaag 

tcagtttact 
tqtacattt-t- 



ctctcagtctg 
gdainjct not 
t g t g ct t c a. 
qagctttata 

t^fig<;ttac^J 
aacaagaci^g 

ggaagagLh h 
occaaggaag 
ga ^rtfj.^gct g 
ataggtga l.n 
qtggatattg 
acLgi>r:tttc: 
cacaatgtac 
«q*?aat&tr.c 

g^ccagattc 
».>h.gc*attta 
t get l. g« « ^ 
c?aagaagatg 

gt t a a a a cce 
r^tqantgagg 

aad l.1:rjitr;.tt 

tattatg^L.-M 
gttcaaggac 
Sw*-nntqcac 
cva g 1 1 gt. h q 
ttttttcsgc 
gnHfcf-tncna 
Dtgatcggcn 
ctMtggttg 



accac^agtl. 
qtccaacttt 
dy I. ,i t cfC 
ttgagaac-Li: 
gnqr-tngfiat 
ttgaatld i.y 
t ttctagtgc 

a^Cl.ganp ti" 
catcjttctgc 
crdrcrjcirjggq 
atgc<r:Al. I.mj 
teaatgeact 

rfiitrjito^Oga 

acagtctrt.yH 
cccittgcagc 

ggjaqndtcrn: 
acagtcgad^ 

tggtttacta 

aLg^^gnqc^ 

gaagaaacaa 
tyyrtl-.hgt.cq 
ttctagaaat 
ttatgacatg 
rtt.^q^^ttcg 
agcagl. I.q*.i 
ttgtccggct 
tggaatttga 

tgy l-Lytqrja 

aattgttttg 
tnijtactggg 
tCTtdC^g.q^o 



agetagcega 

tettegtatg 
aaanctctgt 
yrt.^ n 13 cj h.g i it: 
tgattttaaa 

tgaggcat tg 
rctt cnaqc-n 
tgi^tgctal.g 
agacaacaat 

tgtatatttg 
tcctaattat 

ccaaggaaaa 
ggagncest tt 
1. l: 1: Cj : iLO^i i-i I. 

tgcaaatgta 

MtCDOt-qjiq 

gaaaggtcct 

gga taaagat 
tccct'tats 
catgtnc^gc 
t.CCit»l*.M tt-Kq 
Agd gat gate 
aggagaacaa 
gq#3rttrttt<: 
gtttaatcca 
tctg^ctat t 
nt-l.a jLcji^gg 
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aaaaaaggaa 
gatt battgg 
cagacag^tc 
ncaattacaa 
aac* , bcagtsg 

aatctagA<:t 
tct gt t ctg a 
ctcL(:r?ttgc 
nagatgatta 
tl.tct«tgtgc 
aa ggc^«bca 



ctt ttcgtyy 

a.gl-e*tgaagg 

tttttgytaa 

stgaaaacay 

aa&tiauagtt 

agtacL:tntt 

tccaag^^uC 

btfl^ggagtt 

agttl.a<-aoic 

b-aqocaaaaa 

ttUL^cttcc 



cttgg^mjru; 
y«c*tgtggaa 

qaaggaal.kb 
c^iggctttt 
caga^cagaa 
□ gaag^rt.^ot 
'"tgggaaal:^ 
gggcwcngac 
tqgcccagwr: 
ggArttactca 
aggatl. bgqc 



tc t c a l i : i?o g 
rjittgacatyn 
ta Ly.vtctaa 
g t ca a t <t Ivt t 
^gqngaggt L 
gaawbbgaat 
a ca ga* l: /*t <i 
y ttcattcal. 
agaguycctg 
*cagaaagyt 
ag^w^agaaa 
atgct^l:.^^ 



tt_<:bat a tea 
tgatca-i^-.tt 
wgqaaaatgg 
att^tq^ota 
ttcatdtgc?t 
l-cjct tatatg 
acgyUjqcta 
btacagatg.^ 
tggy ?i gga c t 
tacctacd hfi 
a a c bt aaa g a 



gagt ttaaaw 
C£d qott at ca 
t gataanntt 
cbt ctcaat 
gaccaahqaa 
^nc?aagcc$<j 
ta^riagggac 
acs g a a m ^ qo, 
Agg<aaaatta 

tcata*;l*.|-,qc: 

QMriattgtt^ 



<210> 29 
<2U> A261 

<213> J]amo gapiei^js 



<OI]Cl 
rf^CDgttCCC 
ttacjL^tqttg 
gaggacttgg 
gcL-*<Ji;fccga 
tt ccaagggg 

gtaggagLcy 

uncatgtcat 

gc^g^cacta 

tgagcataLy 

*qattggaga 

attatghnat 

ggagagaagt 

i>atttgaaga 

ttaaaci^rtrtK 

bfcqtgcagat 

taaagtnwcc* 

tt gctcgagt 

actfctjfccacc 

tgggctgtgt 

aaaacctggt 

a t b rstccg 

Lcaactccab 

aqcttattgc 

caqq baanog 

caaaqccLyr: 

tucttgsgaa 

tgAAttctgg 

aatttattga 

bgaoigoggca 

ggaatgtfctt 

qggctgtctc 

1 1 g ft rjiuci n n t 

atggtcgat-.y 

tggaagaatt 

atcjbqgttta 

agattaa «<jc 

caactgaaga 

c^uigtgctgc 

aaa ag^n 3 r;t 

ctttggaatt 

otq±.t?acttc 

aagaa&ty yo. 

atgaaaatc-t 

agAr.^qttgg 

gcaagaagUj 

catccttctc 

gi>tctccaag 



> 29 
crhrjtcctcag 
cctgtc^iwgt 
gcgncaggta 

CCtg t L) » y r|t 
bcaggaacc-g 
tgacccgocq 
cagcctgggr: 
ttiiuitaaatg 
gaeegg Lt^it- 
tatgaaaatg 

aggLbcpttt 
caaggaafctt 
l:c?ctgtattg 
aaa licjtj^t ot 
naatgetea^ 
atyttt^gct 
qaacatat LI. 
L^btnntagt 
tgsaatctgt 
cctttatgag 
aclujaagata 
cagctLgrhq 
nttggagaaa 
agayqadlttt 
aecag^A toa 
tgctaaatae- 

agaagaapyq 
rjaaagaaaag 
agayaagcag 
aagggctggl. 
Li>i?ntc?iccg 
geagegg^fc^ 
c?ctcccagaa 
c:otacagcgt 
tttgg^aqn 
caaacttcgl: 
ywctgacatg 
tgtacl.Af»o>a 
atgggaagaa 
Kcttrjaaaca 
agctl.hgaaa 
aaagagtaa^ 
krtsprjotcaa 
tcacaaagnt- 
^gagatggga 
t^Oaaccgaa 
aaaagl.otgg 



caytngctct 
tgccrcrtLinr-c 
g<:ctc^cagt 
tggggg<;yyt 
^hgtgtttgt 

gtggtArtgtc 
t qtqagegge 
CStCybqqtg 
tatggtcg^t; 
<:<=ttaggagg 
ggaaaagcbij 
aocatctcacs 
gcra^^c-atga 
ctctaeat.-iq 
s^nqgegetc 
Ct,ga£iyi:ntrr 

^taaccaaag 

act^ I: rt^^>q & 

gaaaacaagc 
tl.ybqtacac 
atcti-tiytjcit 
c ctcagtta i: 
ggt:btrtntag 
tgtctaA^yn 
ggacaaggLg 
Ljyatjtgcct t 
ccaaa ^c:ei tr< 
Eiagaaaatgr. 
auyunaangg 
cggtCggayM 
g^r-j^geggtg 

tgttCL^<:Kc: 

a^agcagaag 

A^yyciaaaag 

aaa c*g ^y .^^iq 

ctgaggcaaa 

^ytqnqaata 

aggctcdd^n 

gaacagc-tgg 

o«bbtqqtgg 

ggtggLt^L.i; 

gaagtgggcc 

A^tg^tatt t 

gaggatga^.i 

qaga^agagt 

gglcag^btg 

aaa cat ad: 

gggaaaaacc 



atggtttcag 
ccgga a t c l:h 
c<i<^r^cqct 
ncccagcdisi: 
tttyt.^cgcg 
t gacgcgl.<:rt 
tgnci^qgtag 
ct t cL g i : i : 1 1- 
octgttattt 
gad<:<-^tgqa 
ttcttgttaa 
gtintrjtctga 
agca t cczn n a 
tsatggatta 
tgl.bl.tnnga 
tacatgatay 

ALQg^>cagt 
tggctcga/ai: 
i: i: t « tao caa 
ttaaaM l.rjc 
cct 1 1 cot tt: 

ct aaacgnn t 
cactttcaaa 

l.iirtqttCttt 

taata I./jI:^^ 
aacaggccca 
otgjigqnngc 
atcagalidaij 
yqoitaaatag 
a a g t oq c- 
craggecagta 
at^/j h^aagc 
gacacrttAyc-: 
etatgeagaa 
Lrtrtqactnca 
aagaagr^LijM 

age$Artrt.^Qg 
cgagggta^j 
^ntcaaagca 
tgg^l-.cjcjoag 
caagtaagcg 
p^Taaaagca 
atgagai^Mij^ 
tgattcctct 
bggqngnggt 
C: W A CH y m t r.-: 



ggcgyi^ncg 
tagtcactgl: 

gcgggi.c-cg c 

tccccctgtig 
tagitti.iqQgc 
qccagagacc 
a*T: jq^r^t tgc 
aa-gtccLaj.u 
i.yctctcaggt 
gaagl.«b<?tq 
»tcgacagag 
taa^y/incigg 
tattgtcxrdrt 
^i-.qtqaagga 
agacc^g»bc 
M.^aciattctt 
gcagclihgqa 
ttqeatagge 
t a a a a g t c^ac 
atttgaagcrt. 
*<jbqt ^tcca 

tCCLAcjqq^t 

cgaaaagttL 
g I. btc?gocca 

t g tCL^rrl.cjot 
gaagCatggd 
l<:«3at tccc 
agc3££HMpu 
nttcctgaa-g 

gguc:«qqqaa 

ttcctttttl. 

tyanr^ttac 

tgtagagaga 
r.»rt.^qcocga 
aaatttt^nt 
trj^taccaaa 
act La.i y gcq 
□ aaggaagcl: 
M;iiji?tcagat 
gcaqq l.^^q 
tttaactga I. 
»qnant cctg 
gcatcdi^ \:r^ 
^Q^tgccatt 
cgaLgi;aqtg 
tattaaa^tu 
tgtgctgaa^ 



tg^yi^gt cc 
qacgaggegy 
gcTLiLgqccaa 
cngccgccgl. 
gggl.nqcr^ga 
cjqtgcccgg I. 
ttL<:br%ttnc 
ggaagacdi^. 
Curjirjtgtcat 

agacligt:.^.^^ 

g^bqgeagae 

caaga m1:t^oci 

tataaagaat 

ggl:<j^htt gt 

ttggactggl. 

Cririrsqaqaca 

gattttgga* 

A)<:br?Datact 

atttgggt^i: I. 

c^aaacatga 

ca LLrt^bcot 

a ga cca t c ay 

iibctcccctic 

ca^LU-U^t-cc 

cagaaaatcd 

qcvt.^naaagt 

gt$A,-jyynnn 

agaaggttgg 

yi>tr?aqcnga 

caaggdL^gn 

gqc-Dttggag 

catge^.^ttt 

^ggggaatct 

gtci.^^ccinng 

gccgd££yai: 
qngcgccaacr 
gg^r:^q^ciq 
oaaacaaatig 
hAhrrnaagag 
gt t ccit.<: t <jo. 
cctg t cat 1 1 
««ccnr*gaag 
cgtaggcL^^ 
ggttcttgtg 
ttc:l.<:t.g^ to 
acactggat^ 
gattctaatg 
M^ynttggng 
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aa^tqaatt acagct*Uiy ncagaar:fcc<c tagaaaa^c atctt.ttoaa agLgH.jqttt 2&40 

atgctgaag* ggagaactac anacccttac ktactgaayfl aqagaat^tq cagtg c a I 1 1 3000 

r;«naagaaat M«t«atca gctaotqttg att<:t*ct ga aa<,qaaaagt Kcociaqttta SOftQ 

ctgaggtqtc tccscaaabcj tcagaaggu* atgtggaaga acctgal.q^t ttggaaarnq 3120 

aagttctac* nqagccaagt a^acacaca cayntgggag tttqccaccr. qttcttaatg .lipo 

otgtgtggsc L a y » q*ga a g gaag^qcta agganwi-tga gttyqoagflt aaggfr l-^ctq 3240 

tgy.^cngag tgaagLkh.gt gaagatay** ttccaggg*a cqtggacL:« a tcctgtBaqq 3300 

atcagayayt, tcctgcagta ga.^ynttctc cc.c»qtctgg i^LqtgatgLa g*goaqtcag 3lifttt 

tacagccaga Atoqattttc cagaanqr.gg ttca1:fc<:taa gga f: I. tquac 1 1 a gl: I: c;a qq 3<f20 
iflyttcattg ctcsctdy^i gaaccaaLUn cnnttcgaL*: tcactctgal: tctccaccaa 

aaactaag rt q caagaatccc t L^rtgattg gact.ttcaac tgy t.rMr.qttt gaLcjuiaaca 354D 

atccaaagat ^otqciggacc tgcttatttc cagatctlitc caagrtgtto agaacccta« 3fiOQ 

t.ggcacgttcc cactgLggqij gncgttcaU: **qacagtct tqaa a teg at y.iqctggaag 36f>0 

atgaac:i: fl ot taaagaaggg i:i:ttctgatt ci:y*/*qacac tgL.^httgaa gaaa<:i.qaca 3720 

cagatttacra »qogcttcag gccLewntqq agcagcLy<it tagggag<:":*ii ccaggtgaiiLt 17G0 

aat^agtga ggaggaay*g tctgttttaa wanrjcagcga t.yfcqqagcag a i: » yc*ag a g 361 U 
ggacagatgn wcagacgag gagga^acc ccagcagc:qa aagccc-cl.gp acgaggaaLg 

q^actcagat aataghqncq ctgagacca^ haytgaatgt g^tntgaca g Lg t r;=t 1 1 aa 3960 

ccahttAgag gaactaagar 1. 1 rjsact t gga gca&yanntg ggcLtL-.ysnn agttcttLy« 4020 

ggtttatg^g ^nngtaaagg ctatr.^ahqa ggatgaayat qaaaatattg eciatttgttc *0ao 

aacaatagtt gagbal:*ttt tqggcaatga ^rtccagoat ctchntqcca agatt cfcyr.* 4140 

tttagLi^trj qcagatggag cct.ntcaqga agataal_g*t qaataatc<:l. cngqacattc a^Utl 

tttaatagtc- aA<:tqtaaga acacattCya acit.tggctca t-H*tocaagc ttc:L-Lggqan 4 260 

ate d2e:j 

<210> 30 
<*11> 17S£ 
<212> D1*A 

<213> homo tapirns 

<A00> 30 

tcgggcgiiaij ncg&gaagat gccgtHzgyna ttgaegcaga gn^yngtgea gaagal.chqq GO 

gtqcccgtgg a cca<T: a y yoc ctcgttgccr agal.untqtg ggc c-aaagr.l: qac&aactcz 120 

cccAunqtca tegtcaeggt gggartbceec gcccggggiM .igacDtacat cl.c*:uar?wq 1E0 
ctgactcgcil. scsctcaactg gattggcgU: i;ccnMaaag tgLl.c^&gt cggggagtaL 

■sqeegggagg ctgtg*rtyr-p qtacagctcc tacaaubtct tccgcccc^w r^uitgnggaa 300 

gccALqa^ig teeggaagra atg Lyt:c:t,tc* getgecttga y.^qotgtcaa aagctacriil.y ^f,0 

gcgaaagaag ggyij^cffiant tgcggttttc gALcjvcacca atacLa^l^y ?ign gag gaga ^20 

i:Ar;ntqatcc ttcatttLg^ i:n?5qnnnat gactttaa^g cqtttttcat cga$l.c:ijqtq 490 

tgcrgaoq^^o ctacagttgt ggectc^f^at atcatggaag L t. » m v citct c cagcccggat hJQ 

tacaaagacl w.ao ctegge agaagecatg gauif.i^ttca tgaagaggwl: caqttgctat 600 

gAjaqccagpt accacjctttL ^ccccqnc aaatgcgae» qqqnottgtc get ga Lciaja y fifin 

gtgattga<;y tqqgociggag gttcctggtg Mm:r^q n gtgc aggaccai:Mt. ccaqngccgs 7^0 
at^gCgtact acdigaLijflfl ^:c»tccacgtg cagccgcgl.a ccacctacct gtgctijy<:.^: 

ggcg£iy M «<7q agcacaacct ccagggc^": rjtcqggggcg act^^qqect gtccagccgg &d0 

ggcaagaagt Lt.yr-c^qtgo tctgagcaag ttcgtqq^qg ogcagaaccL y »n^<iziootg S00 

^4 r -gtgtgga ccagecagM. y « y i|<igcacc atccagacg^ i;r-qciggcgct gcggctgL"ii^: ^fiO 

tacrCjAgcngt ggaaggeget caaLg^yahi: qacgcgggcg tc-ly hijiiqg^ <ictgacctac 1020 

gag gaga tf: a cjqqacaccta ccctgaggag tal.gcq^tgc gggagcagga ^nqt.octnt 1000 

t«wgctacc ccat t y ^yy.i q t c a con g gacctggtcc ^qcqottgga g-ccagtgatc 1Dl4U 

atggsqcbgg ageggcagga gaatgtgnly qtcntctgcc accaygnnqt- cctgcgctgc 1200 

ctgcctgc^t. wr.ittcctgga taagagtgcra gagy^yotqc cctacctgaa a I. g i : i irtct t 1260 

r^acaccgtcc tgaAA^:t.g*o q^ctgtcqct tatggcrtgc^ qtqtggaatc catctacctg 1320 

aaii^tgqagt ccgtctgcac aca^i;gg$aq nqqtoagagg atgcAA«qn;i gggacctaac 13B0 
ccgctcatgA q^cqcaatag tgtcaccccg cta^r;^«qi:r; ccgaacccat craa^aay lflfllJ 

cgt^tcaaca gctttgagga gt^Al.qhyqw tccacctcgg tL*:gc:<:ct.ijcc cagctgcctg "1500 

cccccggayc? tqcccacgca gctgcctgga taAArt^tqa aaggctcccg yjiycrayiiq^t 15G0 

goc^tcctcca ggaawE^cty oqqcaqacgt gtcggttcca t-trjir-citttcc atttctgeag lfc20 

cttaguttgt gtcctgcccrt ccrgcetga^g ^.Tactcgtat cctgagga^h tettceggag 1C80 

agggtggyyt ggagcagegg gggagccttg gr:r:(j«*goiga accatgctr.g quwrw^totg 1740 

t gt cccct r: y i^oog 1 7 &6 
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<21A> 
<212> DN/s 

<213^ Homo sapie^js 
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gstttataat 
tn<3tgctgtt 
tgdwittrrtt 
cagcctu Li.r: 
ct ccaagctg 
1-btqrrtccag 
cccgaa<;bgo 
gaagataagt 
cn^y^ttccgt 
ggtcctt^yq 
tgtctctctg 
Aysqqtt cac 
atcctggynl: 
g^gggtgcac 
C t. »cjtq 1 1 
gacttctgag 
rjgagctgcaa 
a1: L c t n y g b q 
attttaagat 

agtgtgaaaL 
ycoccttgag 
a crat cfcvz I: a 
caatatcatg 

gLy l:«<:ntta 
gtgttataett 



>tt.ctaaagg 
ggca l:yqcr-t 
gatgtcatcg 
cttagaacac 

cgtaacgtga 
nagttgctta 
qq l:tt toccci 
tacctgatgg 
qnncatggag 
ai*ij?ictcbgc 
aagcaggtgg 
ottggactct 

ccttgcattt 

ttt taady L<: 

gaggttgtgc 
ggtitibqqgaii 
ctatatcctg 
wyttctqccn 
ccaggtggat 
tccnacat tt 

tC L L y h I. 

ccatataatg 



attccggaLo 
Hfccjqttccaa 

gtgccatcag 
anqncaaatt 
-caaMijtitt qt 
cggctgggyi: 
*catatttaa 
ttgh btcaaa 
aaaaccgtgi: 
gg^aciaggct 

attL^AQbqt. 

acccagatt.t 
L-tijngngtgc 

CtgCdy wy n 

ccatgattaa 
tggg«^qnn 
gtttgaagaa 
i: bot qa t gtc 
atgl_Lgb<-<:h 
a a a cat case 
a LLJc:t_gnn fcq 
octet gaege 
q 1. 1: t a r icirjt-*"? 
tttcccagtg 
natt ctt cca 

taaadrty?ati: 

oMaataaac 



<2:>0> 32 

<211> <3isu 

<£12> UNA 

<2i:s> Kcxmo sapiens 



<400 
ggeggcttet 
qi-t<jctrjcta 

cgacg a gr.f: l. 
agacatggcg 
Lf : i;w q H qcac 
tgaaagcal.* 
aggaagDCCt 
caatgc-icsiac 
tcgcgatggg 
trjatgtaact 
taaaaort^arr 
caaggaag a a 
gntqtggggg 
Dtcaggc£ot: 
ccaqtttgtg 
taatL^flt^c 
tgactgtgtL 
teccctccaa 
<^ a y « < a an qao 
tttcaLtahq 
agggtgtgta 
£Abt;ct nagt 
ggtgtct.yl.^ 
aggaaagagt 
tjqsbttaagt 
ttgttt.r>ton 
atgatgtgct 
agttrtqanca 
tcaattytg* 
aa^igaatggt 

ytic^yrt^gctg 



> 32 

wggtqqqr?qo 

ctgctaccac 

cgtel.h^ivyy 

aa g a gacctt 
btqnaaatga 
ac L y ft y Art<j y 

tatgaegtec 

Agiibflcrjc^t 
ctggagggf.-^ 
caagacaagg 
beqayctg 
gacaggtgga 
t^cacaoegg 
gtcaaagaa L 
tggacactga 
gntt-tttccc 
gcacchgntg 
tttaaaactc 
btaaaggtaa 
a a Lcc:amhh K 
agttttgaat 
iKTgttttatg 

te£ l:yc:hqq l.y 

agggaatcag 
ncatttttct 

a C. I:y I*. bg n t. y 
tttatataca 
bfcttaabnna 
acaaagtgaa 
ctgttsacaa 



qcaaggccgt 
cgccLgy^h<: 
cccggaagaa 

c-a gca en? I: g I. 
cgaaaaaagg 
Myni^drraqqa 

agaggttcat 
qggngntenn 

gcaaaaa&aa 
ggaataaaag 
CCjLgcja^rty i: 
atgacattct 

caaattcatg 
atttctcoLCt t 

ttgacttaat 
aatatgcttt 

ttctttgact 
tcsgtctgtaa 

t. r:ijflL>y:L£LJ! I: I. 

agtcctttgc 
ttgataaagt 
tr^tyfl ht.tt.cj 
gatgtcattt 
ttaaaaqtotg 
a ag r.gt a tec: 
tttgtgcccc 



bi;nttqccac 
gt c c B d L t rj 
cccccagaat 
l:*jtcttaatg 
gaagggA^g 
ctctggcaaA 
n^qtc^ngcaa 
attcai.i^tt 
tccagtggat 

l:^t:bqqangt 
gggagttcai: 
tjcctgtatgg 

ttatgaggl.y 
tttrjc?cagag 
gggLe l:k \:n<: 
C ggaatctca 

atatcattti: 
ttttatctga 
tcctg^agLrl. 
qtat taatct 

3 tiC^Ci l.g< 

tgttatataa 
agL<:/4c.i tco 
aaggatcttd 
t-nr^tacaaa 
cttgaacdy I; 



ggtcctgett 

gaagaagg^L 
gaaagatgat 
tii^n c. K toon 
gaagactct^: 
□nttacgagc 
Lgl.yy y fs Htr.i 
ggactttttg 
u-q ■] oc\ a l1 q g a 
gaagyaayyd 
agaagacctg 

agtgtcetct 

K.xgqct-tMn 

tttacLatrtrt 
ggtgagtcat 

tgagtacs^cL 
gtgaactcag 

tgaatctt^e 
agttaggggt 
a t .>tqt-t.cco 

gectaaatet 
^^qtbc^tqt 
ctaagaacLa 
cacattacat 

cacatccccc 



gcgi:r.c:cima 
gcaactctaa 
« "i r^oit t a oa g 

aagg«<:t.tqg 
atgatggatc 
g»ct ntnatg 
gagggHya.^.-> 
cctaatgttg 
d l.^Kt.qacct 
ggttgcaa(.Li 
nr;nttaagct 
agt gLj^rtK<]^ 
aaagataagg 
wt.ccinqctca 
agtaCdd l.o^ 
qqoataaagg 

ga^C L L. y t. rf .t 
acact ttggg 
rf h t j b t n q g 
tctgtgaLLri 
^ qo o at aag a 
tgtgta^L::L. 
p t qotc?tacg 

ttttC:Lggc:L 

egggotccag 

I. L. L I. y i. y ,*j ,*i o 

caa tg ca a cc 



t y ty c:c:Lcl.g 

gonggctege 

d L L C y c:y« b K 

gacattgaag 
a*jqu^ ngacc 
aL.gaLy V. I. I.cj 
ctctggcagg 

q^<?<:qtqcitn 

gtcggncaag 
ggaggaagca 

tgatggggca 
qctcctgggg 
agaaagggLL 
qtnagaagac 
rj ^ I. iz I. c:<: I: |-./? 
caactctact 
attogntagc 

ClLLrf l.yi:<:A 

atgtctgtag 
*itnqciociata 
LcatLdy t.aa. 
tetgetttgt 

a q a c. y a Una 
gtgctttcgt 
t ayaqecto^ 
aaattttttg 
qgt tagaaaa 
gt^yytyrjh* 
tccrctgi^cca 



^qgatcagct 

tLgl. bqqqgt 

cagatgdatit. 
I: rjicaa cat gg 

Caai:tqrj:a.aa 

qccgtcttAA 
l.^^atacag 
acgL^rittq 
t-qgat tcagc 

gtirtkgqqtq 

atgttgctgy 
MAcacjt gg aa 
a a ggcl.a <-.w 
uqaatcttag 

atgalyt.<;t.t 
aggtgactct 
gg^Liicnriaa 
ctacaacayg 
•*.">qcagtaat 
atggtLLyt.u 
kcaactggtt 
tactgc:aLL:t. 
nottcactga 
tg1_a Ltt^hi: 
t atatcagta 



v cct q-ct q ct 
ccgggacgcLr 
peaatgatge 
a a y ^ga I-, i : t. 
caagcaagcrcr 
b^nctqtatc 
gcagctLLLL 
tettcatget 
a<:«gy ttjhy<: 
aagagaaaaa 
ctcggtcttc 

tgggagtggt 
Ll^<:<:m i : h t tjn 

agatgttttc 
[is t tjqa a at <? 
gagattctac 
□aattctgat 
l- ;s t /* b i^.i t 
ccaggaagcc- 
tgtcact t ga 
g h i] » &<z L. i : 1. 1. 

ttccttattt 
<vn gbt 1 1 a ao 
jjLgHc-.cL.P.a I. 

atagagtaac 
tgtgsat nag 
ggMaa^jaccc 
tgtcagtgtt 
q r -ttqbi:i:r;« 
t c i : d <zc a a a a 



CO 
1B0 

360 

4B0 
540 
bUU 

GG0 
720 
T S3 
940 
900 
9*0 

ioeo 

1200 
1 

1320 

i srh 

1440 

lSbo 

I6t>l 



60 
120 
LBO 
»40 
300 
3G0 
420 
4B0 

Mt} 
G60 

730 
940 
300 
9G0 

1080 
1140 
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^ n a agt t 

g^tctgttt 

tgca^tctca 

gccatgcitct 

aarjaggcttL 

gcctthtggag 

acga gt czi\ot 

*ctcagggca 

gctyl.tttga 

ctgtaccLho 

t.gtttcctgt 

aggta^tnca 

ttcggtggyrr 

tr?*otaggggc 

cct U:tctga 

ctgagel.^ag 

y^ggctggg^ 

gttLyyrrncg 

tgcactggtt, 

tgtjyoqgtgc 

gtgaggtttg 

^Ktttttgct 

Cttttiihttrt 

b^ttcnttttg 

CCi/i<:;]ntag 

aaaaccatwt 
LKr-ncgtaga 

Q!CaatCCL/aq 

£jfc<rbgcagat 
cccagfcnott 
c-tggcsaaca 
ct g h .^^t- c-c-o 
ggttgcagl-.q 
g I- 1 : t c:aaa3a 
acagg^tvtc 
trtacctatE: 
crc-Lyhtttta 
ataaattiiijq- 



ttacttatqt 
occcttaat I: 
gtt«tttgct 
ctecaggqtg 
tcttgttccL 
aagnfrn.actg 

rTtttgatggq 
tgay* taatt 

tgtttCCA<:^ 

y^agtgtgcc 
aggtgr.i^igg 
gaagacraaa h 
yrprsgtagggg 

g'CjCji^Ciagagg 

acc ctgl:y<jc 
cccagtgcag 
ccc Lg^jtqrja 
gcacaggaga 
It.ti^gtagtg 
aaccactgLt 
ggtrntattca 

qtqt gtattg 

at<2l_jaL-.rir?tt 

cttaattggL? 
ymTagggata 
aggcat^Aqoc- 
ttgacactga 
l:<2^tqncjcaa 
tggga^g L :czg 
tqqtgaagcc 
agcl.A<:LrcJn 
agccaagatc 

actctgtl.y<: 
qg-ctcaattg 
atg^wttt-qt 
atgatsteu. 



aaatgfcsott 

tggtagcLg*: 

tHyqntggga 

aggccL:ijtgt 

wt gcat C <: K 

tctgacagtg 

gtggtlycmc 

yttaDtgtcc 

gcLh.bMtta 

tnggagceb.t 

ciirsacttcac 

acggtytciga 

qtcagtaa l.y 

y^gqtagaat 

agg^qcsggag 

tagaagLgqc- 

gi:woggttg 

t c 1 1 a m +: t.qt 

^qtgntcaga 

cagg^iT^tng 

tcactgcai.w 

gt;yc:tqcagt 

qtaaattcci. 

ccaa^ttgcc 

gccagci* I. hq 

hjicc-ctgaag 

CCtaQqaqgc 

ntgtggaL^jn 
yjgrragccaa 
aagaaa i.ytt 
ciqgtgggtgg 
ctgl.irtttnc 
gtggctgagy 
Hiji^cGctgc 
aaattiLl. hfc^ 
c-cteactaga 

a L C t: t r^cac 

cttcettt LL 
l-.+.qea ct a t c 



atg^hqottgt: 
t caca 1 1: tec 
Orttt tgect^b. 
gatcl.«ocgt 
rraacctgyag 
*jggagagag<': 
caagni;c=tgt 
tqcatagii^o 
ag^^aaggt 
occatgl.^og 
ciiJiqtagttL 
tacaijc-otct 
totcaaagL:» L n 
I. tccctga r: 
aaa lipqagta 
agagggayiiq 
gwgqngcgta 
gatggg^tng 
« y aggac ecu 
ctgaagqtgt 

qtcCCtgtkq 

ggai:wttttc 
^qoagtacj^A 

CtC<:*:iiagtg 

ggcatcai:;at 
tc:«ccatact. 
ccaci:l.qctc 
qacttaaat:* 
c^r.^nqctga 
gggctggytn 
fltccqt tgag 
taaaa^htaq 
norygagaa Lg 
acli:r:?igcct 
agagaaa l.yt. 
gl.gcngtagg 
cacagti:tc-e 
gt-cttgttLg 



atgtttgtht 
c L i n a a ga a 
gcagLoc^cc- 
^ctagggtai: 
cgyijacrctgt 
g D CS C a L* i : r^t- 
^.I.qtgggctt 
taatgg^^ca 
agcaaagcLI: 



tggccLi:»r^.i 
<:cacaccctc 
cai^Kit.ccag 
taggcrctg* 

LgK^qgcccC 

ggtggcai-.Ha 
Kttctgttg^ 
gtatdTapqca 
tcactcrgct 



aacg*:iTcsnga te:l:gh<!tcnt 
ctgtgtgCi:+. gttatgtacl. 



g^l-^tcgtaa 

r^nggtt aay* 
ayqcnggeag 
gcagr^yynn 
y^acatggag 
gt^l.Lj/i.Ttagg 



a^r^triOLatga 
gtLagdrigh.^i 
asaccagagg 
a y q.^^a-ca gt 
ggccag^yyri 
♦ ttrrqtagga 
cag t i^tirt q< i 



^Tcaggtgty <>qcc-g t gage 
cci.n tea g cagggf.rj^tg 
atggscanth gagtgtttag 
a L«<?Gaaa t a 1 1. 1 c:gqta ca 
ttattagcagi: nnaeggaatt 
gl.htaqtcag ^LLMiir^cttg 
ttaccr^ifl. Kt aatagattgL 
<J'3anaggctg ^yi.^rprrtgt 
C3df:i:i:i;ar)c t goat gacj \.tz 
tttccagcci: cngccttcaa 
g<:bqagaaac ag L cr I. eje:^ c-n 
cagtgi^cihi:.^ cgcctgtaat 



y tcnggagtt 
ccg a y <. y 1. 1] 17 
<j<^t tgaaccc: 
gg^l.qrtcgqg 
catttgttLK 
ii« b oac^gct 



tg^gMr^r.ioqc. 

tggtgcgggc 

ggrjwgqcsnqii 

atgagactc-l. 
bqt ttt tgag 
cactgad.gLt: 



caaataycih.q qgagnaatgt 
s-tttaatatc tagLyw b<:t-.;j 



<2L0> 22 
^1.1 > 65$ 
<212> DHA 



catgcc^lMjc 
ctjactggttg 

tgtuaqgaag 

ctggaattttc 
giiyqcaatta 
gagcai^aanc 
ctaaacacan 
tgbttgcgsg 
aaggacjncct 
qtcatgtt^t. 
ga Aatccagg 
gggggcaqtg 
^Tsgcaacag^ 
^g^taaatgt 



• 33 

agcttaccMw 
cnagt aaaac 
acfaM/j.-j t-^qa 
agegaatcta 
Acflncntgaa 
taaatga l.t^t 
aagtggccaa 
n^^ccangt 

aoctggagacr 

ggaagttgo^: 
cotccctgag 
g.^qijrrtqggc 
ggatgagy<: 



wotugagtc 
aga Lgg^^>u? 
tgccttacag 

Ctryyg^aa^C 
gaccaagtt.l: 

ac1:^ar:i:n^c: 
cctcctgcag 
ngatctgcag 

ac-at£f:Ag.^)q 

caacatcgcc 

actgt te !:<:<: 
cctaataagt 



tgggrfrtatg 
atagtacay b 

LLwqngtatt 

aagatagL.bo 
» ctagaaacaa 
aag^qtctg 
gagctC4A»q 
m wen a get a a 
at^Mi^cgnga 

at-C35Li:jit.i; 

^lyqcctcgg 
ggccg<:»qc.n 
^tgacactgl. 
aiz^aqtgagg 



ctgga<5rtl-.»<3 
ntgaatgtga 
£a hfltttact 

ggatagay^ a 
b- 1 q n g a a gt g 

tgg-d rt 

ctggagcagga 
»«y«ggagga 
tecaggntje^ 
tqectgeega 
ecLngcjqcec 
ngaggggejrf 
ijpqngtgtgc 
acywnyqecg 



ctatgttcat 

aacaagccac; 
qqacncaqea 
tgataatcL^ 
gtgcactcag 

m a l.yarJti^rtq 

gagggtgctg 
getgegt gac 
ga4.-i^:ggc^y 
tgcctcttcg 

tgggaccttf: 
gccttcgtgg 



J920 

1 f?eo 

2100 

2220 

23^0 
S^OO 
2-160 
2520 
ZSBU 
26-10 
?70Q 

2620 

is ia n 

3000 
2060 
3120 
3160 

3300 
3360 

3430 
Jb40 
3600 

3720 
37451) 
2S^S0 
^900 

4020 
406Q 
4140 



60 
120 
l*EO 
240 
300 
260 
420 
■HSU 

:-iu 

GOO 
660 
720 
780 
H40 
85 f J 



<210> 34 

<211> 10711 

<^^2> DMA 

<2ll> Homo sapienft 
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65 



<400: 
gcgattgehy 
nqaagggtgc 

agaLLtaatt 

gcctcctggg 
^tggggtccg 
byqoiqacgca 
acgcag^q^ 
ttccagagcc 
rxrazt ctntgt 
ctggtg^riyu 
tgtatgatgt 
i-gvoqctgcc 
tctaca£gi:t 
cccgcagtga 
eayi:wtqag 
aaaagactgt 
h t.qgrjgt 1 1 g 



* 34 

qyqotgcagc 
gataagge<:g 
qqgtagggcg 
egefji:qr.*ijq-g 
a ggcg<rg cr: t 
gtttccttgt 
ategagectg 
taagccU*jqc> 
tggcgatgga 
ottqfcgtcatg 

ctg-cacgaaa 
■.?*c<iwitqng 
ccaggtcaga 
ccttattgga 

Ctg liettgcq;? 

gtttgtggat 
rt^cqciggqag 



getgeehoog 
qqctgaggtge 
ggtg C: 5 ga ct 
cccggcgaga 
cit-cacgcDct 
t.flwtqacaat 

g CC*t t_y rt-qqo 

gacctgattg 
t£iL*3tCTCTttC 
t ccg cc c- 1: ij /j 
q^caagtacc 
at£Ahi:cagc 
aactgegage 
ywtqtcatca 
gtcatglih^t 
t<rnctttata 

gattttattg 



<210> 35 
<211> 460 
<212> DHA 
<213> Komo 



sapiens 



<4D0 
Cttttttilicr 
cagacgccgc 
c cz hi : r?r?r7ccig 
aegacegget 

ccccaggc Le 
atgagaagaa 
Aw ?it tcct gc 



> 35 

egrgatgege 
oqecaaggae 
caacra A$gl:l: 
tggcaagctt 
Lg i- n ij i^t'cct 
ggaggagtcr. 
tccDctgcaa 



cogaggtgee 

atcaagaaga 

gccagtgtae 
qc t g c-t gg 1 1 

gaaqjaqtuuq 

ataaagcett 



<210> 36 
<2I1> 441G 
<2L12?« DHA 

<213> Homo sapiens 



<400 
gctcg r^acagg 
atattttatt 
tttcatgaaa 
ta 1 1 c c t a 12 r.i 
ccagaatgga 
^y^tqetttg 
ttcraAgtuyt 
gcaggtcatc 
a^ttactaaa 
gaagurtytt-f, 
atcaaaaagg 
gaAgjip^rt^Ci 
tattcatgea 
taagagagag 
gaeLtfaaaaa 
tcttctgtcze 
gctaaaggag 
gaeh^qctt 
tgctgctgr.t 
aagaagcaat 
t^^t.gtattt 
gtet catena 
tatactgagA 
csctaaataaa 
aaageUicnq 
ggttcaacta 



> 36 
qttttatagg 
gLt.L:l.t.Al:L:i; 

gaaacattat 
**anncagata 
as cc ti^eal:g 
gcagtagaag 
qttagggaaa 
aAAgriHtfg tq 
cagcccaccc 

caagaagrat 
*jcttcatcaa 

gttattgatt 
ctgaagaccg 
r.LijjJrti^uba 
aa gtctggtg 
atacatcagc 
«i:Mflflqq*>tc 
gtggtr.ataa 
caaaacaaaa 
qtbocjtgagc 
gatgi^AgL<:a 
ctacgccagg 

caagaageLg 



atcacattga 
i^-t^st-curt 
atgatgaag r. 
aanagaaagc 
L i; y a r:t r;. 
aLgatcaagt 
gaaaaaagaa 

ctccctctga 
gcgatqacca 
t geeec Lcea 

LU^bbtqrtt 
tgcttaaaee 
aaactgacaa 
tyntrptt tt.c 
taatac-aaga 
ttcaagaaaa 

i| y t q t G 5 rjr^rt 

caaggatgaa 
tacatgtcag 
nqntggnggc* 

gCdMLTrtiJlMc: 

attatgetag 
tccaaaagan 
ftQaijanij-L) l:y 



agaccgay»rr 
<;tqggatgct 

agCCLg<j<-:q^ 

a^gttgttaa 
g^i^oiacaga 
aaa tnr^i^tgc 
t-ccacacc-yc: 
^<9c»ttttccg 
atctggcirnr- 
^tgacaaggc 
agt| l.tw^caa 
gggeggag^ 
ni:tttgtgaa 
L eg ctq cm & q 
caaciaaacaa 
ca.L^ia^ ij q q g 
tttataataa 

i^Cjgq.^qtgca 



caa^^gl.Mi:<: 
tottacagta 

agaaaagaa* 
tqngagtgta 
t gtd qt-t- 
ggaaaagaaa 
gattcctggc 

a^:dgi:%ijciD 

ggatiaaaaag 

gacagaaaaL 
ggataatget 

LyA«c^>qCa 

agaaaatgcL 
t rjaaqat gag 
t.qcz I: I: I: * o c»q 
ggacaagtta 
cittaacccag 
agaLtt^rtcttt 
ttatcaag^g 
agagatagct 
aaat.^cuictg 
gttggtg^.it- 
rrnntgetgag 
ggMMifnagtt 



tgggtggag.: 
1:t:tcccct.tc 
gccg^i<:g<:L 
Hijggagcgag 
agn^qr^rjtc 

tCCCCL?.yi:t: 

q^qgaagacg 
cccLi.b^tac 
tccaagtgag 
Crtttzqtgaag 
caaaca L^-m k 
qctggtgggg 
tgagirtyt^qc 
cgttgcagga 

g<-q«caaann 
gtcttgtLtL 
yyc^ttottttt 
gcccaaaAfl,"? 



acgcgL«j;iq? 
cctacctgct 
L^l-.hqq^cag 
tgaatggaa^ 

ttl.ytitqqtgg 

ctgccccL9<; 
ntgatgacat 



□tqqagtttt 
atttLL-Ctc:!: 
cagaaaagag 
w^q^a^nnnji 
cctcgagacrl: 
ccat tgaacg 

aaaaa&gtcig 
tcgaagaaga 
gLi^gAAA<:h<: 
aaacaagaaa 

y 1. 1 : K Kt:c? bc*< J 

g a ct caa gt^: 
gaagggatcc 
qdny t.gw flf^t. 
gctctttgtg 
oagbcoagta 
s:f:Cy i: l.y i :t q 
gaaatgatga 
ggancattag 

: 1. 1; d y y i jo 

gaaagcaagc 
gcigctgactg 
<r:A.igr.3yL:tw 
cagagctaca 



gggtcttc-cl. 
c-gc:gc*qqaag 
qgcacctgca 
gt-yyttnttC 

ccgtggggl-.c- 
t-caga ca t c n 
cglig^qocca 
agacactgyy 
ytrsqcatggag 
aagga^ l.l-.qf? 
qacaagtact 

caqijniTr?tqc 

tatgqagtcg 

Ahgqqct tgg 

caat a^ c 1. 

ycitoqagagt 

cacag^aLn.^ 



cttctccgcc 

qycihqcccta 

cgt gggt a r.^ 

gg ct gt a gee 
bqenqoagag 
gggat.l.tqqr.^ 



gc-e Ltr<:g rr^q 
gggggcaart 

qnqqr-qqacg 

gacgtcatLg 
gl.ri-.ritqct g 
gagaaga&ag 
ottt ttgatt 



atgagtcage 
tct ti^tgg&t 
aacaaaag^i 
agaaagaaat 
1. 1: rtrt a 1 1 * t<i 
tegt tgaaac 
tgcttgaaga 

aaccagggea 
tcntqqt ncc 
gtggaLcaijg 

^t'.qt.g^jl-.jaijfl 

agaaatctgg 
ttaaogat tt 
ttgtagAeLl 
agggagaatt 
tq£i3i?qc*c>qn 
cagagaaaga 
aaaaggaaca 
tgc^gatgaa 

MqqMM{irtK<3q 

agtctgeaga 
agaaaacagg 
ct u«y U I:gp9» 
tcaggaagag 



120 
3no 
240 
300 

4$0 
5*0 
600 

720 
VI3I1 
S40 

1070 



190 

420 
4ftO 



£0 

iyo 
2^0 

3 no 

360 
420 

SAO 
GOO 
660 
720 
VBIJ 
$40 
90 n 
$60 
1020 

1140 
1200 
1260 
1320 
1360 
1440 
1.501} 

ii>eo 
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caiiagcaaga kttacagaqt aaatl.L.qtgg 




aacag^qyna catg&ijiTcag 
bgaagtac-ay 
gc-a 
acaa* 

gttccattM cagatag c » 9 cccaga^tc cgcLhcaget rtagcagaa* aa tta*=*taa 
S™* a ? ga " *9«9 atflM * DcagaeLc |M gaUctttag .:«gtgaa ft g 
tgat.i^ttta acaagtaaag angaggaact taaggatai:* cagaatatga atri- cttatt 
a?h»2™« Jtg«g4«t tacayqccct ggcaaatgag *»gq«g t Lg ctgeat-tqa 
attgqagaag atgociacaaa ijtgtttatgh taaagaLgat aaaaLaeqat tdctaaaaaa 
gcaacta.; M catgaa»ttt caa MR aaaat gga^qaattt a^ottct^ StSa«"S« 




aataagaac* qaaaattcai, c^tctgacAnA ogaagtt.r:#ia 
tcjntcaqgtt tct.tttgcct nfcctagttga atjaacttaag aaagtgaU-r 
tggaaag^bo aagtcLytng aa<ja$r;ttct ggag^agaa cLd:tcaaag 
qgagaaaact ijttcaggatt. tqaasca^g* oataaaggcl. ctaaaagA*q 
tgLoi^ijctt gaaa^gq^tc aac^qttatc taLiiacttcc aa»qttc*agg 
cttatCaaMA qgaaaaga<jq aacagaLg** taccatgawy gctgttt.byq 
" — gtc: 9 a.tacag ^ynagtggtt acasgatctt cwnqaagaaa 

gttcegynsq tagcaiinnca taacttgd»p 
^ttgagattg Lyttgaaaga uHfirrggaaat 
gagagqqnq/a gt ga iicttt c tagcaa«Hi-n 
nao.ttgt.tLtf orjtcccaaaL fr.qaq cag c t i 
ettcuccctc at^MEjaatt attaaa»qta 
ccetggaatq agttBgal.ir^ tttgaagg^h 
c?aaaggcag<r: rt.aonggtgga aytrb^ttgag 



gagg ^ctgaatyi 
gactl.^nng ctd/icjcaasa 
ntgag*A.iQa 



aaaagcai-^t 
*jtttgaagaa 
catgiihocsaa 
agatgaaaAt:: 
ggc-vbcttct 
aataagLggt 
qa aaaaca a f. 
ca^oaaatta 
tggatLtgaa 
gr?ttctagag 
tga A^ntac 
tgttgagcwa 
acacratgcag 
taaggah.^tC 
ttqagatggad 
aac^citctica 
taaggcrtL:** 
tantqt tatt 

gtCtgtrtnrrt 

aca get cac a 
tcabttgagg 
tttca^uctt 
tngactactQ 



gaggcrchott 
y^ttgaaga 
cagcLqttnc 
» o acaa ca a a 

qcmgttgaa<: 
lil.qqnggcta 



Ll-.q eta a cad 
aaat.ayy oaa 
agett-cageja 
aagagwaga 
»tqaatct LI. 
ctQCwteaca 
qqttagaatji: 
aggAKqtaca 

actdC<Zdai:;j 

gayagaaaga 
a-cca gagg*o 
^qaagttct 



ttt<;enaagg £gLi:bqtccc ttctaatttg agtt«L:yqtg aal.ggbbgca 



aaaa£ggi;an 
i?acaagttga 
aaa Lccqtcc 
qaagaaaal:.^ 
Lc^toEittta 
gaaa a LfH 17 a 
rjqat eta cert 
aaaozbbagaa 
eagtcactgy 
gartflcitagtg 
ctaaatLz.irjQ 
aaggagaaag 

tctacLttgt 
zitttaaagaa 



aagaaLgl-.pb 
H*igo.agctga 
ttgttMgaaac 
aatggaaagl 
cahettcaga 
gaagaga^i-q 



qgetggaant. 
Lya^tgcac 
agaag^rt»bt 
taa^gtcgat 
a ca a y-.M qc t a 
q n a ca 1 1 1. y 



tcagggteay 
a Crn btgttac 
tt acaCftt/ioc 
g«#tcacaca 
gagcy»tt.oa 



nggaggLLa# 
ayetagagtg 
tacag^^/j^q 
»qactattaa 
gaagt:y^aEia 



^tqt"tacaga agl^pgagag 
ctgavi<:<ia&a tgaaaga<:nq 



r?aaatggaar tagaaaaggc 
ttg»^qqcac agi.t.^ciatga 



arjettatcca 
atgKttcDec 
ctgtaacai:M 

rtqonttaDCB 

caLLybcatta 
cagaaac^l: 
yrraaaaaaaa 



gtcctaaaata 
agaaac^^ciq 
Tttacagcag 
ggty htngag 
taLttt^i:ra 
qoaoagagtL 
aag<r:<:i>actc 



agttttntaa taanaactgt Ctganhnott agaccttL^^ 

aatctLLhat ettattthge fccaataaaat t. ijt tea gaa g 

gbaaaattta a^fctttaat acLyn^qttg tacactyt.tt 

aactg^t.QC: gacttcawr^t caagaaaatii i:bcttttgtt 

tacitggcgtc utjcaaataaa agejnt^ctta ttatLn 

<2lO> 37 

<211> 62G 

<312> 0NA 

<21:j> Honso aapietn.? 



a a ggt ag<r: l-.q 
gt.Hfloo.gctg 
C ettctgrtget 
l-Kq^ttcagg 
tgaacjLaatt 
nattaaagi:i: 

1-Hrrtcttt tC 

tgtay^yacc 
ot t cctga^y 
aLf:«yaqtgg 
tacttaiLBt. 
gtih^^tgtaa 



gtgattt^c.^ 
cLqrjdqacac 
aggagaca^t. 
ttgqtaaacca 
ggga^r^tgt 
ttatttst^U 
Lwntccttgt 
ttcrag^^ttt 
citaaacatgl- 

tttgggaayi 
ttgybtt ttg 



1620 
1740 

1320 
1 PPG 
20d0 
2100 
2160 
2220 
22BU 
£340 
X 400 
2*60 
2520 
2E.BQ 
2640 

27<30 

2PB0 
2LidD 
300 0 

3120 

enno 

324 0 
3300 

3420 
:i*no 

3Ei40 
3600 
jbbU 
3720 
li78D 
3940 
a TOO 
3960 
4020 

4140 

4 260 

4360 
441^ 



<22D> 

<222> 294. .264 

<223> n - o r c, g or 1 

<22(J> 

<221> unsure 
^ZZ2> 594 . . 

<223> rt = a^ cr, d or t 
<220> 

<22l> unaure 
<222> i>57_597 
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<223> a - d, g t 
<220> 

<221> un£U.rR 
5 <2V.2> 607. . £0'l 

<223> n - q, c, g oj- t 

<*t>0> 37 

-T^cacgagaa c^^tctaaa aaUjgnagcg l.rjaccagg*. ataaaaagt^ <igaaacl^l-c 60 

caaa **^* aga^btq cre t Lr,r,acc aac,» qa otaa a^Uqaaagt 120 

ggatcag^« iiqiiagaaagc l:toatcaaa<, aaa oaaas y « caqaaaatyt cttcciLan^t. 160 

S^cocctta ttCHtrjc^c tactl.*tatt cctLKqntgg al.^tgctga r.hcaagtcrt 240 

gtggt^ata agagaga^h tntttat1:U ? cttaaa^<:l.q accn^U*qa aggg*k<:csn 30U 



15 




360 

_ 9QT <**:<w*gt:tac l.<:<j.ct<ictgt 54 0 

ga^qaoqat gctgrt^ty caaaggal.^ qtgtaa^-.jiq ttaett:<:««q aatnal jn rii 600 
aagaaanart* qaq&aatttg ^tnt^caa " *™ 

<:U0> 39 
<2L1> 130 
<212> DMA 
21 3> Homo sapieu* 



25 



30 



35 



4Q 



.10 



55 



60 



65 



<220> 

<22l> udsiutb 

<222> 127. . 3 27 

<223> ri - a, c, £ or t 

<220> 

<221> unsure 

<222> 260. . ^60 

<22I» n - aj t:, y <lij; t 

<220> 

<221> unsure 

<222> 307, ,307 

<223> n - a r c/ n or t 

<220> 

<?21> unsure 

<222> 339. , 32$ 

<223> n - a, u, r? or t 

<220> 

<221> unsure 

<222? 336. . 336 

<223> n - c, g or t 

<220> 

<221> untUTM 

<:222> 4 73- - -173 

<222,> n - n, g Dr t 

<221> un.^ur-G 

<222> *87 . . 4ti 7 

<2£3> n - a, c, g or t 

<r?20> 

<221-' unsure 

<y.2Z> 434 . -4 94 

<22i> 3i = n r g or t 

<220> 

<22l> unsure 
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25 



30 



35 
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50 



55 



60 



65 



<222> n = a, c, y or t 



<220> 

<221> unsure 
<2Z2> 6d0. . «0 
<223> n - a, i:, 



<220> 

<222> fifU. ,681 
<223> n - a, 



q or t 



g or t 



-220- 

<;>31> unsure 
<222> 

<223> n - Mj c, g or L 
<220> 

<221> unsuiv- 

<£22> 703. .706 

<223> ii = a r c r g or t 

<22o> 

<221> uneur^ 

<222> 714 , , 7H 

<223> n - c? r c, g or L 

<22\)> 

^22 1> Ujisuti? 

<222> 726. .12$ 

^223> ii = a, Cj g or I. 

<22Q> 

<221> uridyl" e 

<222> 730. .730 

<223> n = a r c r g oj? t. 



A.^ w ra;aa 1 1 eg g££t:ij9rTtTn 



agatf.yyctt 
agactynrU.H 

qaaaganaga 
ggrA^ataat 
gatgaa-gtht. 
««atggnata 
ct g cray v act 
naacaggaaa 



ctgtccttga 
aaggagaagt 
aLryiiKtat nc 
gctgcy^cAji 
aijcnntgtgg 
gt atyy<:rtrt-.i 
cagcaagttc 

aaataaacta 
gctacagcaa 



^Ce^a^gttC ca a^lL«i:nc;g 



agaccgaa<ic 
»q act at gat 
cLggLijl:^at 
at ca get tea 
■^qqatcggtg 
ttataacany 

gtgAQtv^q^t 
gcagtcagca 
i?gr>ca(3gatt 
gaggaagt^:* 
nacgectgrcg 



Ly« i : n 3 n g a a 
gttttcL.qpa 
acaagatget 
flyrt^aaggac 

taagcaoj. k.i 
qattrnnagay 

ggnggcagag 
aoa^tscaaa 
atgctag^LL. 
.^qaagaatn 
agaa^yl-.qc^g 



aatyc:h£jciag 
gatgaggcU: 
1-tnnngaagt 
aa-gLywritcc? 
a-cccaggaa<4 

caagagatyi: 
m h.nr^cycact 
tcaaiil.qq^a 
qgtgaatgag 
cUijMycinngc 
ggaggcgnt.t. 



tgaagtttaa 
tt tgtgt tgt 

idfjy tAAqyi-| 

ctg-stgtgaa 
fc^atrjacaqa 
cattagaaaa 
ancaqntgea 

agcaagcagt 

agctactcay 
cagngcctac 



60 
120 
1PQ 
240 
300 
:-]hi] 
420 
4B0 

720 
7 30 



<210> 3& 

<3U> 571 

•C2l2> T>MA 

<213^ ]l<riiiw> sapiens 



<220> 

<22'1> uij^yro 
<222> 37. . 37 
<223> n - a, c, 



<220> 

^2^jl> unsure 
<222> ^42 
<223> n - a. 



g or L 



q or t 
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<220> 

<22l> unsure 
<222> 54, . G4 
5 <223> n - <z r q t?r t 

<220> 

<22I> unsure 
<1'ZZ> 67 . .67 
30 <223> n - * f cr or t 

<220> 

<221> umaure 
<222> 73-. 73 
L5 <2^:^> n - n, c r g or t 

<220> 

<22l> urusii m 
<Z22> 66. . 39 
20 <223> n - *>, g or t 

<222> 10C 10? 
25 <?.23> n = c r g c-r t 

<220> 

<222> 117. . 117 
50 <223> n - cs # g l>i- h 

<220* 

unsure 

<222> 

35 <223> n = a r c K g or t 

O20> 

<221> unsure 
<222> . 1S3 

40 <:223> n - a, c r g or t 

<:220> 

<221> unsure 
<22'2> 24:i. . 24/1 
43 <223> n - a r c, g or t 

<220> 

<?.?.!> unsure 

<222> 2i7..2.57 

50 <223> n - a, c r g or t 

<220> 

<221> unsure 
<?.2Z> 2§4 . . 264 

55- <22li> /i — n t r^, rj qt; t- 

<S20=- 

<22l> uji^ur« 
<222> 313. . 3ltf 
60 <223> n - a r c, g or t 

<220> 

<:221> unsure 

<222> 337.-337 

<22.*> n = n, c, g or t 

<220> 
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30 

<27Z1> unsure 

<222> 342. .3^1 

<223> rr. zi f c, q or t 

5 <220> 

<221> u/ititire 

<222> 353. _3S3 

<2£3> n = a, f:, u or t 

If) <220> 

<221> xmsure 

<222> 362. .362 

<22:s> n - a, c, n or t 

IS <'2?-0> 

<22l> unsure 

<222> 3?3. .373 

<223> n - a, ii, g or t 

20 <220> 

<22] > unaure 

<222> 379,. 37 9 

<223> n - m, c, g oz I. 

25 <220- 

<221> unsure 

<?23> - a, c r g or t 

30 <220> 

<221> unsure 

<222> 419. . 419 

<223> n ~ g or t 

35 <220> 

<221> unauris 

<L22?> 421, .121 

<223> n - a, c, 9 or L 



40 <220> 

<222> 427.-42? 

<:22 3> n - » r c r g ox t 

45 <22G> 

<221> uneui't 

<22^? 433.4 33 

<223> n - ** t c, g c>r t 

,10 <220> 

^221> uneuro 

<222> 440. . 442 

<223> n a, c, g or t 

55 <2^i)> 

<221_> unsure 

<222> 444^444 

<223> n - a, r. r g or t 

60 <220> 

< 2 2 1 > UIJ.S u 

<222> 44B. . 4411 

<2i23> n - a„ c, cj nr t 

65 <220> 

<2^l> unsure 
<"222> 4bU..450 
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<£21> unsure 
5 <222> ^ 55 - ,*55>* 

<223> n - i^j q oi t 

<220> 

'^'A2l> unsure 
10 <222> 4 0^,-43 6 

<223> n - a, r: r q or t 

<220> 

<221> imBUx* 
15 <2^> ABB. . 4B9 

^'223;* r> - K* r g or t 

<221> uiiHuro 

<223> n - c, g or t 

<2 2 1 > i jei.^ v to 
25 <222> . <iy7 

<223> n - a r c f g or t 

<220> 

■;'122j> unsure 

<223> n - 5, g ox I. 

<220> 

<22l> i>n.rp\irc 
35 <222> &O0 . . bnj 

n - a r c # ^ or t 

<220> 

<£21> unsure 
40 <222> 506.. 501 

<223> n — b i c r g or t 

<220> 

<^2D> unsure 

45 <222> 315. . SDS 

<-.2^^> n « a, c, g or t 

<22 1 > u n 5 urc 
50 <222> 520. .5^0 

<223> n - a, c, g or t 

<22C» 

<221> unsure 
55 <2'2'2> 523. . 523 

<223> ft — & f o, rj or t 

<221> uri.^urc 
GO- <222> 525. . 526 

<223> n - a, g or t 

<220> 

<2Jil.> unsur 
6S <222> 539, .53B 

<223> n — c, c>t- I; 
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:4O0> lis 



aayai^Ht^c 
tggnagng^l: 
teiactgtgtt 

ttCrtQggqqn 

tqnctggata 

ttAAL:1:tt9t 

gncttctgi.a 
ntctcgntct 

catcggcctt 



CttttttAh^ 

ctnt ctcctt 
acagU-tqnt 
a cat cacti. I. 
a tj i"; t ccnqtg 
tcagttcLn^ 
«cntagggng 
ccn^uctngn 
cgnagnntjgn 

t.MCIctttPCC 



Ptcaaa l:«<j;> 
i-<jtgrrmrii*jn 
ttagqcttnc 
tttcaatasn-: 
atl.rtt.tgagc 
<lttttgtgg y 
gl:r;n.ntccct 
nngm:aannn 



<2JD> 40 
<212> 

<]£l'.t> Homo sapiens 



rcagtti^c:i:i7 
sm nnnnnnnc- 
agncaLt.qtc 
tf^tagtgtcL 
ctta l.gcaaa 
tgtt tcr>t t 1. 

CllrttiTCCttt 

ccgcnaLe.tt 
t.qoanccggn 
t 



anttacrt i.l-tm 
<:^ccynnccg 
tccl.l.«.-l-.i!^9 
cccgcagct I. 
■tCdi"r elegit tot 
■nactgtgccy 

tcLfltjttWC 

tcgcttcLI. M 
H:nannsggc 



ct ctta c a lift 
wwtQQtnctg 
gaggcLcicjcjt; 
ttnctatttt 

CCttCty Kt:t- 

Hcnac-tctct 

ttti^.^nnrr 

ntteggtcta 

tUytrrtnnt t 



60 
12\) 
130 

420 

alio 

54 D 



<22Q> 

<22l> unsure 

<222> ^S. .435 

<2£3> n - a r c, y r?r t 

■C220> 

<221> unsure 

<222> . 520 

<223> it — a, i2, 17 ox t 



atALl.^tnrja 
cagcccagac: 
•nt.na.g-ca gat 
asgaagayijn 
aat tacaggtr 

tttcaaacaa 
tioiqttcagaa 
t ggaa a a # t.q 
ggacttattc 

aCrac-ay ;irjt t 



- 40 

gcagaatgag 
^tccgcttca 
aaaa Lrtig/t-ct 
acttaaggat 

CfiLgqi^aat 

tgttaaaga l. 
Mcit-qqaagaa 
gctacagjuit- 
cattcaagaa 

caagacttac 



yr^tttqaaag 

gt tcLfltirioiq 

gaagattctt 

gagcaggcty 
y.^ttffliaat a a 
tttaagal.L.i; 
cttgtttctg 

taaagagagg 



<210> 41 

<211> 56C 

<212> DNA 

'■123 3 > Horac- sapiens 



ctcaa^i 

L«y<:»^qtqa 
tgaatttel':!. 
L-I.yotqcacs 
gattgcl.Ljtj.^ 
tnnntgacca 
EBcaQiir-taa 
<lttaaagaLTl. 
£5^4- L-. g 3 1 q era 
tgatca-ggLL 



gc^gt tccat 
L.i.iiiqtgatt 
acrgt^a L<:9h 
:i ft a a a a get 

t gaa L l.y y hij 

.aqagcaacta 

^Eddi:HHni]r.n 

taaggatgtt 
y tqqnnqat t 
tagacag^d 
rjrtty 



LiintTCiq^t ng 

gcagai.iayi] 
tt-oncaagta 
gaagttjf:rt<jtt 
aagat gcaac 

t taanatcag 

acttgacatt 
t. r. t ct ct g 



60 
1 2f> 
180 
>4n 
300 

4 20 
430 
MO 
COO 
e>*4 



<220> 

<222> 
<:223> 

<220> 
<221> 
<22a> 
<223> 



11 nsure 
4 V . 4 7 
ri - a r 



unsure- 



q ox t 



<"221> ijr>?tujro 
<222> 91.. 91 
<2Z3> ri - a r Cj tj Or; I. 



<:220> 

<22l> unsure 

<223> n - ft f c, g or t 

^•'22 1> urifiurK 
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<222> 4 41 



442 



15 



2ft 



25 



5U 



35 



40 



45 



50 



S5 



(50 



65 



<220> 

<221> (jyjfsuT-c 
<:222> 448. . 44ft 
^22 3 > /t = a r c r 

<:232> 4 57. . 4b f 
^223? r> = Sir c, 

<220> 

<221> unsure 
<222> 525. . b2S 
<22:*> n - a, c, 

<220> 

<2^l > unsure 
<222> 

<223> n - a, i:, 
<220* 

<221> unsure 
<222^ 5f>4 . , 564 
<223> n - a r 



g ox b 



g or t 



g or t 



q or t 



<? or t 



Ctgllyi^ct-C-t 

oygt tgagcc 
ttl.<:,iantcc 
ctcttcrta^A 
A <jnaggcaa 

*gbqtcaaga 



tcasggtfrl: I. 
tgaacaaggg 

tttcctttLa 
it ttcaaqct. 
cgaacagLl. L 
<K>ct tgatct. 

i£;q cir-Jc:c:t.-:n.T: 
qtt Dttattg 
a,£LI:i:t.ti?ca 



r;aaactggga 

Ct Lyu^OAQa 

ctttctrt L I. 
ntnnqttcyg 
■ggBCBLLLiir- 
Knt octtgtt 
ttcat<: be- 

ncatttgnta 
cagnct 



r.g^Mijnngag 
ntcsaLy ynt 
ot-cttccaad 
aag^l.<:ctga 
tntttcttct 
ag^acst tg 
tcatggatca 
gbttt-anqct 
ctcttcrbt.t-.,-} 



^ti^ fcntntca 

t C-t J1 ga rf grttj 
#c:^q&ettca 
act L Uj q ci o rj 
tttagagect 
a^aa^l.q^og 
otctcttaag 

CQrfWtfttCtt. 

ggtgnca<:;:a 



agtl..-»tqtng 

CCCgt3ca<;<:n 
L *3 U T 0 1 1 C3 1 

tg a l a h 
I. K<Jt ttcccg 
CCtCCHgn.iq 
cjbet tea act 
ttgL<:^qnga 
naataaggnc 



SO 
120 
2BD 
240 
sou 

430 
!S40 
566 



<210> 42 
<2I1> 878 
<21K> DNA 

<213> JlDtnn> sapiens 
<220> 

<221> uri£ur& 
<2?.?.> 595. . 

<223> n - g or t 

^"22 1> uriMi?rc 

<222> 756,-JSfc 

<223> n - a r c r g or I. 

<22t» 

<221> unsure 

<22'?.> 76G. . 766 

<223> n - o, g or t 

<222> 615- .BIS 

<223> n = a r g or t 

<220> 

<"'221> unsure 



BNSDOCID: <WO OQ20597A2fTL> 



WO 00/20587 



34 



PCTyUS99/22S73 



<2?.2> 81B.-SLG 

<22 3 > ^ - a, g iir t 



<2 20> 

<221> undujpn 
<222> 827- -627 
<223> n tt ar c# 



g or t. 



<22tt> 

<221> UAAiiro 

<222> . B:n 



<£20> 

<22J > u-n^nre 
<222> _F?^3 

<223> ii - a, c: r 



*220> 

<22l> unsure 
<222> . 
<22D> n - a r c, 

<40D> &2 



atgcaacnra t 
gagaacttct 

ttctttctci. 
otqttggtag 

ggcttctaac 

taaaagattc 

agttctgaag 
acnttcctaa 

caacctttct 



cctctgacetl: 
tonccataac 
t La qcz<z t-. noci 
tgttcaactg 
citit.fcotgaaa 
ttttigtt^tt 

ctctLiia.^tt: 
qaagaggcct 
att LLL>ttct 
ttccsaaaca 
r.l t^tgaact 
ttrctccttt. 
gcaactttga 



q or t 



13 or t 



qwaqttccag 

actcaacagc 
i-atvcttcaa 
ttactttl.rt^ 
Cun^ctgcLc 
gtgtl:l. I:c ? c:t 
cattttcctt 

tgaagatcc-t 
Q<T:<T: L I-. ril ci g g 

nnttfcagcct.t.a 

gAAAh h<?qqc 

tttccnaaag 



<2LU> 
<211> 620 

<213> lloiiKi 



<22Q> 

<221> unsure 
<222> 391. , 391 
<222> n - d, c , 

<220> 

<221> itfkftuT-tt 

<222> 400. .4011 

<223> ji = a, c r 

<220> 

<221> unsuiB 
<22K> 411, .411 
<223> n - c, 

<220> 

<22I> uneut« 
<222> 414 * . dl* 
<223> n = a r Cji 



g or t 



g or i. 



g or t 



g or t 



ccataiiMttc: 
ngggacagac 

Lb:i;;jcct?|t 

agaatcLaA*: 
t**attcttca 
aat t Lyyqac: 

agaaagatca 

t "L C: I: Ltr^^cic 

atgttgtgcrt 

gt h tj i: i : h rct t 

tattcatct^ 
attccn^l. I.K 



ttttqccttr 
rffir:ht.t-gqaa 
tgctgccl*. t.L: 
tcnttccaga 

ttanacaatt 
CLcLi:i:i?tct 

ccctqtattg 
ttL^tctttt 
ttgagccctL 
ccincmnccga 



Ivttt-caatcc 
ata^sLLL-.t.tt. 

DOLttgttttt 

ga^rt^-ttnt 
aagaagaiL^n 
U)lr.1:ttcat C 
cttttagt^L 
yl-.fc.nt t caaa 
catgtgtl. I: h 

ccttttsaad 
tr-oagcctgg 
a a crag t.l: I . I. r? 
nttQatctXt 



60 
120 
nan 
240 
son 
360 

480 
540 
&00 

720 
760 
BOU 



<220> 

<221> unaure 



<222> ^ie. . dip 

<-?.?,3> n - a, c, <^ or t 

<220> 

$ <221> unsure 

<222> 421. ,421 

<220> 

tO <?-Zl> xinsure 

<222> d27, , 428 

<22 3> n - a r e r y or t 

<220> 

15 <221> unsure 

<222> ii - g or t 

20 <221.> unsure 

<222> .00b 

<223> n = a r c r g or t 

<:220> 

25 <221> unsure 

<222> 'iiifl . .455 

<233> n = a, Oj g or t 

<220> 

<222> Jtfli. . 0fi3 

<223> n - a r c, g or M 

<220> 

35- <221> unsure 

<222* <J65_.46±> 

<223> n = c, 9 oi t 

<220> 

<222> All. .471 

<22'S> n - q or t 

<220> 

45 <2i2l> ur^-uro 

<222> 530. . flBI) 

<223> ii - dt r o f ci or t 

^220> 

50 <Z2l> unsure 

<222> 4H4.,4g4 

<223> n - a r c:, d or L 

<220> 

55 <221> unsure 

<222> . ISO 

<223> n = n, o, g or t 

<220> 

GO <221> unsure 

<2'2'2.> 4 93. .4 93 

<223> n - a, cr, g or t 



<220> 

65 <221> unsure 



<222> 5{)0, . 500 

<223> rj - a, Cj g or t 
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LO 



15 



<22i>> 

<2?. 1 > uruure 

<22%> . .515 

< 2 S ?i > ii = 3j c, cj or t 

<220> 

<221> onftorv 



^222? 



<223> n - d, r. r 
*.220> 

<Z21> unsurfe 
<^-l> n - a r ^. 



g or t 



q or t 



20 



<221> 
<222> 
<2S3> n - 



unsurt 



25 



30 



35 



40 



45 



50 



55 



60 



65 



O.V.I > unsure 

<.2?.~3> n - a, c r q or t 

<22t)> 

■^2il> unsure 

tl = a r c, g or l. 

< 22 1 > u rv.mxo 

<222> fi0<]. , 600 

<223> n - a, Vj ci or t 

<220> 

<22I> urssu.r& 

<222> 604 . . 60a 

<223> n - a, ^ t 

<Z20> 

'".V£L> unsure 
<VZ2> 60S ..COB 

ji -a, c, g or t 

<22l> unsure 

<222> 616. , 516 

<22.-s> ii = a, c, g or t 

<M^1> iijiBLire 
<^K2> 619.. 613 
<2x.":?> 33 ■= a, c, 



g or t 



acctt.a^c^t 

tgagaa cx U: 

tttctttctr: 
cctgttggta 
cctgtacat.f: 
nggggtnncra 
csangggters 
ctttaagstc: 



43 

uctctgaccc 
ttcaccataa 
tttagcctcc 

U^tttctgaa 
■lttttgttgt 
utqirnacagc 

*U.U.<r:li;tg 



tgaattccag 
ctcaaattag 
sactcaacag 
gcatccttca 
attactttta 
ttaagctgct 
tgtgttttgc 
atacanttcc 
gcccctttca 
aagatccngt 



Cf:al.ACLaL-t O 

aagg^ai^qtj 

aagaAL.iUw*i 
3taat.t.C:l. Lij: 

caatl:l:^i^2^i 
naag^^i^Un 
tttnrjctl:!..^ 
agtr>nggr.i.^ 
acnnttcccg 



ttttgccttt 
oocctt tgga 
ttgctgcctt 
est cattccag 
ut^aggggga 
ctt aaacaat 
□ ct ccccctc 

i-gOTacttcc- 
»>«ttggt El^q 



ttttcasatt: 
aataattttr. 
tcattgttr-t: 
agaccactt.A 
aaagaagaf.g 
ttgtftt^ai. 
nccnttanrra 
naggttcctzn 
tgaacatgng 
cttcoctt.cn 



60 
120 

3<10 
360 
420 
430 
!>4 0 
«0U 



BNSDOCID: <WO 0020587 A2TL> 



WO 00/205*7 



37 



PCT/US9M2573 



ttCTiaaanag crttcvmqgng ^ 20 

•C210> 4 4 
<211> 62-J 

<213> [tonio snpiens 
<Z20> 

<22l> unsure 
10 <222> 25,-25 

<223> n - u, r;, g or t 

<220> 

<221n unsure 

<223> n - g or t 

<220> 

<221> uneux-it 
20 <222^ 412. .412 

<223> n = ^ c r cj or l. 

<320> 

<"22 1 > it n^u rs? 
25 <222> 414. .414 

<223> ri - a r ^ iir t 

<22D> 

<22l> unsure 
30 <222> 423,-42:* 

<.223> n = a, c, g or t 

<220> 

35 <222> 42 6. . d2ti 

<223> n = a r c, g or t 

<220> 

<221> unMt>rG 
40 <222> 435. .43S 

<223> n = a, c f g ox t 

<22D> 

<221> urt^yrrj 

<222^ 444 . , 44* 

<223> n - 3, c, g or t 

<220> 

<2'2l> unsure 
50 <222> HSt>. . d50 

<223> n - a r c t g <*r 

<220> 

<221> uneur* 

S5 <2J?2> 454 . .454 

<223> n - 7i 2 c, ^ or t 

*220> 

> unsure 
60 <222> 4b9. . 4 59 

^223> ^ - a, o r g or t 

<220> 

<221> unsure 
65 <222> *90. . 490 

<223> n - &, C 4 \j ryr t 



BNSDOCID: <WO 0020587A2TI_> 



10 



IS 



20 



25 



3D 



35 



40 



45 



WO00UOS87 FCTYLJS99/2M73 

<220> 

<22'1> unsure 

<222> 50a..5DG 

<223> n — a r c, q or t 

<220> 

<22l> unsure 

<222> bll..511 

<2.2 3> n - a r <i f ij or t 

<22D> 

<22L> unsure 

<222> 513. . 515 

<223> n — a, f;^ g or t 

<220> 

<221> unsure 

<222> 524 . .524 

<:223> n = a, Cj g rtr t 

<220> 

<22l> vnsure 

<222> 54D.,iJ5 

<22\l> n = a, c r g <*r t 

<220> 

<22l> un.?ure 

<222> . 

<22:i> n = a r g or- 

<220> 

<22l> unsure 

<222* i>6fl . .5M 

<2 23> n = a, c, g ox h 

<120> 

<22'1> UAiur^i 

<222> 60D- . 600 

<222> t-- * r c, -g or t 

<22n> 

<222> 611. . 611 

<223> n = a, g or t 



<4 0D> 4 4 

accttaac**: cctotgaccc tgaantcunij ccntacattc tLL:«.ij^cttt ttttcaaa l:<: 60 
c=*tgcaacca tteeier-fltaa otcaaattag aeyygoMija cacctttgy* aotaattttt 120 
tgagaacttc tttagcctcc aafcl^acflci cttccatcrtg ttqctqcctt tccLLyijanrr 190 
tcttgttcac tttqtcctgc agcattttLL ^qttgntgc caatgcktcc nttgcttccc 240 
agtttttctc ccgaaggtr:* ttqtttttct. tcctcLgytrg ttcaactgca Lnuttcaaag 300 
aa t ct_/sa ot c a± t ccaqa ga cmc h ttt ct ttctctc L 1: t r^tnjaaat t a crt 1 1 bp 3 sc- 
at t cttc a 1. y agggggaaaa gaagat^t^l: qgtgggaatt. tggl^ttTKi ancnggccaa 4 2D 
ttnggnacrt aaa»n*t-ttci gttncaactn gtata* W c?ng taaaaactgt yttbtgctaa 4G0 
55 aaagshoacn ccccctctcL tJ.awgocmTn nttcnaaceL <:t.tncittcat ttccrtttL^I: .54 0 
tcaanaaaLt tc^ryntctca aacngggatg uHHfljrjnggc tcttcaagl.t *a<3gtggGcn 60 0 
An=ttccggaa natijgc^ttt aan G23 

<210> 45 

60 <2ll> 625 

<"212> DNA. 

<S13> Komo sapiens 

<220> 

<22i > unsnx 

<222> . 392 

<223> n - a, q l>i- t 
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<220> 

<2Zl> unsure 
<222> ,409 
5 <223> n - a, c f *j or t 

<22D> 

<221> unsure 
<Z2X> 417. .-417 
™ <223> h t. y r Cr g Dr t 

<?.?.0> 

<221> urtrturr? 
<222> 421. .421 
f5 <223>n=a, c, goxr. 

<220> 

<222> unsure 
<222> 424. .42$ 
20 <223> n - & r <f or t. 

<220> 

<221> unaure 
<2^X> ,43"? 

25 <:223> n - a, i: r y or t 

<22tt> 

<221> unsure 

<22?.> 439. . 4 33 

3fl <223^ n - a, l, g a: i: 

<22D> 

<22L> unaure 
<222> 4 SR.. J &6 
^3 <223> n - a r c, g or t 

<220> 

<221> unsure 

<222> 0 67. ,4 SI 

40 <22 3> n - aj c r g or L 

*220> 

<27"l> unsure 
<Z22> 4KB. r 459 
45 <223> n - a, c, g or t 

<220> 

<221> unsure 
<ZZ2> <73, .473 
50 <223> n - i;, y or t 

<220> 

<2Zl> unsure 

<222> 4 VS.. 475 

<223> n - a, c r g t 

<220> 

<221> unsure 

<222> 49(i_49G 

60 ^223> n - a, ^ r g or t 

<220> 

<221> unsure 
<222> 499. .500 
65 <223> 3i = ^ y r q or t 

<220> 
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<22l> ensure 
<222> . 

<22.*$> n - a, c, ij or t 



<220> 

<2^>> unsure 

<222> $24 . . 52H 

<223> n — g or" t 



<22D> 

<221> unsure 

<22J2> 535. . WS 

<223> ii = a, c, g or t 

<2'20> 

<221> un.^uro 
<222> 536- 

<2j*3> n - a, c r g ar t 
<220> 

<22L> uii sure 

<222> 5tin_,560 

<223> n - a r y or t 



<220> 

<22J> unsuie 

<222> 30-2-.-, 5B2 

<223> n - a, g r>r t. 

■c220> 

<^2l> ynsure 

<£23> n - aj £ r g t 



<22D> 

<221> unsure 
<222> 5&4. . 

<223> n - a r g or I. 
<220> 

<22l> unsure 

<222> tOfl . .604 

<223> n. - a, c r g w I. 



<400> dii 



catgceacca 

tetter hot ij 
tttctttctc 
cctgLt^gta 

□tttafifcrjrtt 
ttcncttcaa 



cctctgaccc 
fct caeca taa 

gtgttcaact 
I'-^tttctgaa 
gttt-t^t hyt 
ctggtsacag 
c a ftnc<?n tna 

catttcttcn 
^anngectea 



LqMayt r;r-crg 

ctDaaattag 
aactcaacag 
gcafcciiLttro 
attactLtta 

ctgtgtLkhy 
aattacattt 
qcrctctttca 
Lgeaa LL-.c:ijt 
aggta 



<210> 46 
<211> 4410 
<212> DM^i 
<213> Komo 



ccatacattc 
& a y rjrjn cq ga 
Dttcrcaci;tT 
aagaatctaa 
ataAktcttc 
caatttgggia 
wtagaaaag 
cccl:Kr»tctt 
a gttan.gr. 1.9 



GSCCLLLgq^i 

ttgetgeett 
rtl:t:.itt-ccag 
atgagggg^A 
rittnnacaat 
a£.tai;tr^ncc 

aaacaanant 
tqcnacttDC 



aataattttt 

t.r^Tttgtttt 

dQdCcactlia 
antigaagatg 

ctctctnttt 
Ctnan^tt ot 
tgaajia i.fihij 
angricttccc 



50 
120 
LBO 
24 0 
300 
3 CO 
420 
430 

GOO 
G25 



<40D> 46 

gccgcacagg gttttatagg atcacattga Crt« .i ^c?tacc atggagttl:*. cttq^tcagc GO 

atattttatt gttotcnttc cttcaatagt tattacdybf* attctcctct tcLLcl:gqc.t 120 

tttcabgaaa gaaacatt&r. wtgatgAAyt tettgeaaaa, CAij^mnagng aacaaaagct iftft 

tattccia^: *>aaacagata aaaagaaagc a^w^angaaa aagaat^daw ^ttancjaaat 240 
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ccagaatg^a aaoctccatg *ntccgactc tgai^qtgta c r.t orraga c L ttaaattatc $00 
*yntgctttg gcAQtarraag atgat.zajqt tgcacctgkt DDattgd*tg tcgttgaAac 360 
ttcaagtagt gttagggaa* gaaaaaagaa ggAanagaaa <^aaagcctg tqcttgaaga 420 
gcaggtcati^ ^nagaaagtg a^atcaaa gatt*r.l:ij(jc aaaa^gtag aacctgti?cc 460 
agttactaaa crageucaccc ctccciicbija ngcagctgun tcgaagaaga aaccagggca 54 0 . 

gaA^aaqtct aaaaatgg** qcgatgacca gga hciaaaag glijrroaactc ti*ra hcjcjtacc 6D0 
atcaaaaa^g caagaagcat tgcm^tcca ccaaga^ct asaraag,*.^ gtggatc^iyq 660 
ynngaagaaa gcttreatcaa agaaaraaA* gwcntjaaaat ytcfctcgtag aKq^ncccct 720 
tat LOflt-qca actacttatA fctcctttgat ggaLaatrjct gact<:«agtc ctgtggt.arra 780 
taagagagag gfct<»ttgatt tgctt^Aji^ tqaccaagta gaagggatcrcr aqa^atctgg B*Q 
g^tnaaaaa ctgaagaour] nnnctgacaa agAAv^frgct gaa^l.qciaqt ttaaag*fcfct 900 
ccttttgLcc ttgaagacta t g a f.qttt t d tgaagatyaq gctcttLgLg ttgtagactt 

gctaaaggag aagtctggtg taatAr:»oq a tgctttaaag aaqtcaagts ayqnragaatt 102<J 

c?nctacgctt atacat^gc ttcaagaaaa g^a^aagtta cl_i:qcitgctg tgaatjgaoqa 1080 

tgnl^Qtcrct acaaaggaL<: gqtqtaagca gttaat:<:caq gaaatgal.y* cagagaaaga. mo 

aagaagcaak gtqqttataa caagyatrriui agatcgaa LI: qqaacattag a^qqanca 1200 

tantgtattt caa^A^aaa tscatgtcag ttah^nngag a ct a gn tgcagatgaa "i2$0 

gtr.tvaqcna gttcgtgagtt eyatggaggc agagatar^t cacttgaagi: aggaaaatgg 1320 

tatactgaga rrntgcagtca gcaAC*<;tnci nnatcaactg ganngcaagc agtcrtqcsga 1360 

actaaataaa cLsi=qccngg attatgctag gLhgqtgaat gagel.gactq agaaaacagq 14 40 

aaAgr.tffcag caagaggaag t.r^aaaagaa gaatgcLg*q cnngcagcta ^hcagttgaa 1500 

ggt tea acta c-ancraagctg agagaag^Lg yqaagaagtt cag^yctaca tcaggaa^wg I5fi0 

aar^qeggaa catgaggc«g nacogcaaga tttacayayt aaatttgtgt* cwaagaaaa 1620 

tgaagtaeay ^gtctgeata gtaagethac: »qatnccttg gl-a-k^annc aacagtegya "ifcgo 

gonaagacta atgLagtt-aa tggaatcaga gcagaaangg gtgaacaaay aagagtctct 1740 

acaarthqeng gttcaggata L L h tqqj rrcn gaatgaggct h t gaaaqct c a a a LLowqr^a 1B0O 

gttccattcc ^egatngcag cccagacctt ^g t tcagt i: ctagi^y^q nattacataa IB 60 

ngtqattgca gaaaiigtfBt.Js aqcagataaa acagacLgwa qnttctttag caaghga^cq 1920 

tgatcgLLha acaagtaaag aagaggaai:L fc a a ijcjat ata cagaatatqc* atttcttatt i960 

□ aaagctgaa gtgwg^yt tacaggccct ggcaaal.y.>q caqgctgctg <r;l-yi;arjfl.tqa 204 0 

attgyag^q atgcaacaaa gt g h h.t.a t.gt. t<anngatgat aaa*hf]agn.t tgctggaaga 2100 

gcaactacaa catgooattt caaacaaaat gga a ga a ttt aagatLctaa At^r^M 21 GO 

£ha^ ijccitta aaatcagaag LLi:aga.ar?ct acagactctt gl: Khc^.qanc agccLaataa 2220 

ggatgtt^Lg qanoaaatgg aaaaatgeat tedflq^ana gatgagaagt ha^rtqactrjt 2280 

q<jaagaatta cttgaaA^ttq gacttattca ggtggcaatL a.T.ir|a3Lga<ig agetgaatge- 21 AO 

aata^an^ci gaaaattcat ctctgacraaa Aga.^gttcaa gacttaa^g rt^cjeaann 24 00 

tgntcaggtt tcLtfctgcct ctct^ttga agaactlr^nay anaqtgatcc atgagaaaga 24 

tgga.a^qatc aagtctgtag aagagcLtnt. yg^ggcagaa cttcrtcaA^g ttgctaacaa 2520 
ggagaaaacL gttcaggatt tgaaacagga aataAAg<jc.t Dtaaaagaag aaaCatjgM« 

tgtccagctt gaaaaggcti: M«wqttatcs tatcacttcc M«qttcn.gg agcttcagaa 264 0 

cttalit^san ggaaaagagg aacagatg^ haccntgaag gctgttttg^ «c*ga*Tflartga 2700 

gaaagaccta gftr^atacng ggaagtggtt acaggaliefcl. caagaagaaa atgaatcttL 27 £0 

aaai^trcacat gttcsggaa^ i-Mgi^^flc^ taacttgaaa gaggc^:l;^tt ctgcatcaca 2820 
gtttgaA^o*^ cttgagattg tgttgaaaga i&rtqqannDt gaattgaaga gg h l:nq»ngc 

c^itgctaaaa gagac^qg^qa citgatctttc tagcaaaa^a c:a^tqttac aggatgtaca 2^-10 

agahqnaaac aaattgttta agtr:i;i^aa t tgagcagect aaacaa^A^« pn-fcaccanca D000 

ggcatd:hct ttcccccctc atgaagaatt att^wqbn atttcagaaa gagayAaaga ^n^Q 

aataagtggt cLctg^atg agttagnttc tttgaagg^L gragttrjaac accagaggaa 3120 

gaaaaaccaat gaaaggcago ^ftc^ggLiyga agntgttgag ttggaggctr.A ^aqacigttot 3160 

caaaaaatta tttccaaagg tgtctgtccc ttctaatkfcq agttatggtg aatggttgca S24t> 

tggatttgaa aaA£3yqrj:cta aaqaotgtat ggctggaact tcagqqtoag aggaggttaa 3300 

ggtfectagag cacaagttga AAgA^gczLgii Lq/inntgcac acattgttaL: ag^t^rragt-g 3360 

tgaaaaaLAi^ a.ao.tccgtcc ttgeagaaac agaaggaatt- ttacagaagc tacagagaag 3020 

tgttgagcaa gAAQaaanta aatggaaagt taaggtegat g^<itc«y^Cin agactattaa 3-4 BO 

ac^qntgeag- tcatcatLta ^at^ttrj^g^ tmaagagcta gagcgatta« yaagc-qaaart 354 0 

taagcjatfltt gaaaatctga gaagagaacg agaaoatttg rjciaatggaac tagaaaaggc 3600 

agagatggaa cgrttctaoct ntgttocaga agtcagagag tLgAaygo.^ agttaaatga 3^60 

fi^tt^Dtcaca aaacttagaa ctgaai^adAa li^^gacag aaggtagctg gtgatttge-a 37 2ti 

taaggct^^ cngtcactgg agcttatcca gtcaa^L^ gtaaaflrjctg ctggagacac 3730 

tactgttatt gaaa^t^gtci ntgtttcccic agaaaeggag tcttctgagw. .^qqagi^aat 39^0 

gfcvtgtaagt ctaaatoaga ctgtaacaca gtha-^gcag ttgettcagg eggtaaacca :i*500 

acagcrti^a^c? aaggagaaag agcactacca ggtgttagag 1 y a a gtn.oit t gggaaactgt i960 

tcatttgagg afcaart./*Mqq cattcTtattn tattttgeca aattaaagee l;i.fltit.^Kgt $Q7.0 

ttLtt«ccctt tctactttgt cagaaacact gaa^ag^qtt ttqtcttttc taatccttgt 4080 

tagactactg ^tttaaagaa ggaaaaannn aagccaactc t^tagar.ai^c; ttcagagttt 4140 

agttttataa taaaAACl'.qt ttqnntaStl: agacctttac attcctgaag AlLAAa^Afc^l. 4 200 

Afltcrbtttat etcattttge L^aataaaat tgt liE:oqaag atcaaagtgg taaagacaat 4 260 
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crtaaaattta wttttaat acL^tqfctq tacact^tt tacttuanflt HHL^a^nt 4320 
a attest ct gacttcaacb caaeiasaaca clitttttgtt: *i:t-*atgtaa tcggtttttg d3B0 
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Database EMBL 

Entry HSAA24402; Accession number AA211485 
02.02.1997 

zn55c06.sl Stratagene muscle 937209 Homo 
sapiens cDNA clone IMAGE: 562090 3' similar 
to TR:E245548 E245548 Chromosome XII 
Reading frame ORF YLR097C 
Hillier, L. et al . ; in relation to SEQ ID 
No. 4 

XP002140895 

see the abstract 

SEMENKOVICH C F ET AL: "Human fatty acid 
synthase mRNA: tissue distribution, 
genetic mapping and kinetics of decay 
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vol. 36, no. 7, July 1995 (1995-07), pages 
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GANSLER, T. S. ET AL. : "Increased 
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survival " 
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NAKAMURA , T. ET AL.: "A novel 
transcriptional unit of the tre oncogene 
widely expressed in human cancer cells" 
ONCOGENE, 

vol. 7, 1992, pages 733-741, XP000670282 
see Figure 3; in relation to SEQ ID NOs. 7 
and 18 


bb , bo , 
60,68, 
69,71, 
72,75 
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RICH, B.E. AND STEITZ, J. A. : "Human 
acidic ribosomal phosphoproteins PO, PI 
and P2: Analysis of cDNA clones in vitro 
synthesis and assembly " 
MOLECULAR AND CELLULAR BIOLOGY, 
vol. 7, 1987, pages 4065-4074, XP000909977 
see Figure 1C; phosphoprotein P2; in 
relation to SEQ ID No. 35 


55,56, 
60,68, 
69,71, 
72,75 
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SEKI, T. ET AL.: "Cloning of the cDNA 

encoding a novel import in alpha homologue, 

Qipl: Discrimination of Qipl and Rchl from 

hSrpl by their ability to interact with 

DNA helicase Al/RecQL" 

BIOCHEMICAL AND BIOPHYSICAL RESEARCH 

COMMUNICATIONS, 

vol. 234, 1997, pages 48-53, XP000910159 
see Figure 1 

& Database EMBL; 10.04.1997; Entry 
HSAB2533; Accession number AB002533; in 
relation to SEQ ID No. 35 


55,56, 
60,68, 
69,71, 
72,75 
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JIANG, C. ET AL.: "Characterization of 
colorectal cancer related cDNA clones 
obtained by subtractive hybridization 
screening " 

JOURNAL OF CANCER RESEARCH AND ONCOLOGY, 
vol. 123, 1997, pages 447-451, XP000909986 
see the whole document as far as related 
to phosphoprotein P2 
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A 


WO 96 29409 A (UNIV LEIDEN ;LUDWIG INST 

CANCER RES (US)) 

26 September 1996 (1996-09-26) 

the whole document 


1 


A 


SAHIN ET AL: "Human neoplasms elicit 
multiple specific immune responses in the 
autologous host" 

PROCEEDINGS OF THE NATIONAL ACADEMY OF 
SCIENCES OF USA, US, NATIONAL ACADEMY OF 
SCIENCE. WASHINGTON, 
vol 92 1 December 1995 (1995-12-01), 
pages 11810-11813, XP002091914 

ISSN: 0027-8424 
cited in the application 
the whole document 
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SCANLAN, M.J. ET AL.: "Antigens 
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patients with renal -cell carcinoma" 

INTERNATIONAL JOURNAL OF CANCER, 

vol. 83, 12 November 1999 (1999-11-12), 

pages 456-464, XP000867569 

see whole document in relation to SEQ ID 

Nos. 1, 4, 5, 7, 12, 15, 16, 18 and 35 of 

the application 

& Database EMBL, 06. July 1999; Entry 
AF155110; Accession number AF155110 
Scanlan, M.J. et al . ; in relation to SEQ 
ID Nos. 1 and 12 

& Database EMBL, 06. July, 1999; Entry 
AF155114; Accession number AF155114; 
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Box I Observations where certain laims were found un earchable (C ntinuation of item 1 f first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. [xl Claims Nos.: 

because they relate to subject matter not required to be searched by this Authonty, namely: 

Although claims 78, 79, 81-88, 90-107 are directed to a method of treatment of 
the human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 

2. * Claims Nos.: x ., , . . , 
because they relate to parts of the International Application that do not comply with the presented requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



^— ^ because^mey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



B x It Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 

2. Qj] As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. [77] As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
LX-I covers only those claims for which fees were paid, specifically claims Nos. : 

1-12, 14-17, 19-22, 24, 26-35, 37-44, 46-50, 52-69, 71-79, 81-88, 90-107, 
109-116 (subject 1, 4, 6 and 22 of non-unity motivation) 



4. f - ) No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest 



j | The additional search fees were accompanied by the applicant's protest. 
| x j No protest accompanied the payment of additional search fees. 
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Claims Nos.: 1-4, 14-17, 20, 30-35, 39-41, 57-59, 75, 78, 79, 81, 83, 
84, 98-101, 104, 109, 110, 112-114 



1. The terms NA group 1, NA group 2, etc. and PP group 1, PP group 2, 
etc. are applicants own reference signs and have therefore no recognised 
meaning. Consequently, all claims in which these terms are used are 
unclear within the meaning of Article 6 PCT. For the purposes of the 
search the terms have been interpreted as shown in the description, pages 
14 and 15. 

2. Present claims 1, 2 (as far as points (g) - (i) are concerned), 3, 4, 
14-17, 20, 30, 39-41, 57-59, 75, 78, 79, 81, 98-101, 104, 109, 110, 
112-114 relate to an extremely large number of products and methods 
defined inter alia by reference to a desirable characteristic or 
property, namely in the case of 

1) claims 1-4: "specifically binds to the nucleic acid molecule, an 
expression product thereof, or a fragment of an expression product 
thereof complexed with an HLA molecule" 

2) claims 14-17 and 20, 78, 79, 81, 83, 84 and claims 98-101 as far as 
they relate to claims 14-17: "enriches selectively the presence of 
complexes of an HLA molecule and a human cancer associated antigen" 

3) claims 30-35, 39-41: "binds selectively to a polypeptide" 

4) claims 57-59 and 75: "of sufficient length to represent a sequence 
unique within the human genome" 

5) claim 104: " inhibits the expression or activity of the protein 

6) claims 109, 110, 112-114: "bind to one or more MHC molecules presented 
on the surface of the cells which express an abnormal amount of the 
protein" 

The above mentioned claims cover all products and methods having or using 
a compound having this characteristic or property, whereas the 
application provides support within the meaning of Article 6 PCT and 
disclosure within the meaning of Article 5 PCT for only a few or even 
none of of these compounds or methods. In the present case, the claims so 
lack support, and the application so lacks disclosure, that a meaningful 
search over the complete claimed scope is impossible. Independent of the 
above reasoning, the claims also lack clarity (Article 6 PCT). An attempt 
is made to define the compound or method inter alia by reference to a 
result to be achieved. Again, this lack of clarity in the present case is 
such as to render a meaningful search over the claimed scope impossible. 
Consequently, the search has been carried out for those parts of the 
claims which appear to be supported and disclosed, namely: 
group 1) of claims: antibodies reacting with a peptide encoded by SEQ ID 
1 and nucleic acids hybridizing to SEQ ID No. 1 

group 2) of claims: polyppeptides encoded by SEQ ID No. 1, host cells 
expressing them, complexes of said polyppeptide with HLA, nucleic acid 
having SEQ ID No. 1. . 
group 3 of claims: antibodies and fragments thereof reacting with 
polypeptides encoded by SEQ ID No. 1 

group 4 of claims: complete SEQ ID No. 1 ... ccn 

group 5 of claims: antibodies reacting with a polypeptide encoded by SEQ 



page 1 of 2 



ID: <WO__0020587A3_I_> 



International Application No. PCT/US 99/22873 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



ID No. 1 

group 6 of claims: no search 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT) . The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-12,14-17,19-22, 24, 26-35, 37-44, 46-50, 52-69, 
71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
os. 1 and 12 



Pharmaceutical compositions or isolated molecules comprising 
a molecule having the sequence of SEQ ID No. 1 (cl. 42, 43, 
55) or the corresponding protein (cl. 48, 68) and HLA 
binding fragments (cl. 56, 71, 72, 109), an expression 
vector comprising said DNA molecule (cl.61) and host cells 
comprising the expression vector (cl. 64) as well as related 
antibodies (cl . 115). 

Fragments of SEQ ID No. 1 which are long enough to represent 
a sequence unique within the human genome (cl. 57, 75). 
Kit for detecting the presence of a SEQ ID No. 1 molecule 
using nucleotide fragments of SEQ ID N0.1 (cl. 76) 
Method of treatment consisting essentially of increasing the 
immune response to a molecule encoded by SEQ ID NO. 1 (cl. 
78, 85, 88, 102) or in the administration of an antibody to 
such a molecule (cl. 95) 

Methods for determining regression, progression or onset of 
a condition characterized by monitoring SEQ ID No. 1 or 
encoded molecules, fragments, etc. (cl.5) 
Pharmaceutical compositions comprising an agent that 
enriches selectively the presence of complexes of HLA 
molecules with polypeptides encoded by SEQ ID No. 1 (cl. 14) 
and use in a method of treatment (cl . 98) 
Composition comprising an agent binding to a polypeptide 
encoded by SEQ ID No. 1 molecule and fragments (cl. 30-34) 
or a conjugate thereof (cl. 39). 
Method of treatment by inhibiting expression of a 
polypeptide encoded by SEQ ID No. 1 molecule (cl. 104). 
Method of diagnosing using an agent binding to SEQ ID No. 1 
or encoded molecule (cl.l) 



2. Claims: 1-10,12, 14-17, 19-22, 24, 26-33, 35, 37-44, 46-50, 
52-69, 71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
os. 2 and 13 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 2 and 13 



3. Claims: 1-10,12,14-17,19-22, 24, 26-33, 35, 37-44, 46-50, 
52-69, 71-79, 81-88, 90-107, 
109- 
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116 all partially as far as they relate to SEQ ID N 
os. 3 and 14 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 3 and 14 



4. Claims: 1-12,14-17,19-22, 24, 26-35, 37-44, 46-5G, 52-69, 
71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
os. 4, 5, 15 and 16 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 4, 5, 15 and 16 



5. Claims: 1-10,12,14-17,19-22, 24, 26-33, 35, 37-44, 46-50, 
52-69, 71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
os. 6 and 17 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 6 and 17 



6. Claims: 1-12,14-17,19-22, 24, 26-35, 37-44, 46-50, 52-69, 
71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 7 and 18 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 7 and 18 



7. Claims: 1-10,12,14-17,19-22, 24, 26-33, 35, 37-44, 46-50, 
52-69, 71-79, 81-88, 90-107, 
109- 

116 all partially as far as they relate to SEQ ID N 
os. 8, 9, 19, 20 



essentially idem as for subject 1, but related to SEQ ID 
Nos. 8, 9, 19 and 20 



8. Claims: 1-10,12,14-17,19-22, 24, 26-33, 35, 37-44, 46-50, 
52-69, 71-79, 81-88, 90-107, 
109- 
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116 all partially as far as they relate to SEQ ID N 
os. 10, 11, 21 

essentially idem as for subject 1, but related to SEQ ID 
Nos. 10, 11 and 21 

9. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 22 

essentially idem as for subject 1, but related to SEQ ID No. 
22 

10. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 23 

essentially idem as for subject 1, but related to SEQ ID No. 
23 

11. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 24 

essentially idem as for subject 1, but related to SEQ ID No. 
24 

12. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 25 

essentially idem as for subject 1, but related to SEQ ID No. 

25 

13. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 
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116 all partially as far as they relate to SEQ ID N 
o. 26 



essentially idem as for subject 1, but related to SEQ ID No. 
26 

14. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 27 

essentially idem as for subject 1, but related to SEQ ID No. 

27 

15. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 28 

essentially idem as for subject 1, but related to SEQ ID No. 
28 

16. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 29 

essentially idem as for subject 1, but related to SEQ ID No. 
29 

17. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 30 



essentially idem as for subject 1, but related to SEQ ID No. 
30 



18. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 
46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 
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116 all partially as far as they relate to SEQ ID N 
o. 31 

essentially idem as for subject 1, but related to SEQ ID No. 
31 

19. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35 .37-42. 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 32 

essentially idem as for subject 1, but related to SEQ ID No. 
32 

20. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35,37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 33 

essentially idem as for subject 1, but related to SEQ ID No. 
33 

21. Claims: 1-9,12,14-16,19-22, 24, 26, 28-31, 35, 37-42, 44, 

46-48, 50, 52-54, 69, 71-79, 81-88, 90-106, 
109- 

116 all partially as far as they relate to SEQ ID N 
o. 34 

essentially idem as for subject 1, but related to SEQ ID No. 
34 

22. Claims: 1-9,11,12,14-16,19-22, 24, 26, 28-31 34, 35 

37-42, 44, 46-48, 50, 52-54, 69, 71-79, 81-88, 

90-106, 

109- 

116 all partially as far as they relate to SEQ ID N 
o. 35 

essentially idem as for subject 1, but related to SEQ ID No. 
35 

23. Claims: 1-12,14-17,19-22, 24, 26-35, 37-44, 46-50, 52-69, 

71-79, 81-88, 90-107, 
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116 all partially as far as they relate to SEQ ID N 
os. 36-46 



Pharmaceutical compositions or isolated molecules comprising 
a molecule having the sequence of SEQ ID Nos. 36-46 (cl. 42) 
or the corresponding protein (cl . 48, 70) and HLA binding 
fragments (cl. 71, 72, 109), an expression vector comprising 
said DNA molecule (cl.61) and host cells comprising the 
expression vector (cl . 64) as well as related antibodies 
(cl. 115). 

Fragments of SEQ ID Nos. 36-36 which are long enough to 
represent a sequence unique within the human genome (cl. 57). 
Kit for detecting the presence of a SEQ ID Nos. 36-46 
molecule using nucleotide fragments of SEQ ID Nos. 36-46 

(cl. 76) . . . +K 

Method of treatment consisting essentially in increasing the 
inmune response to proteins encoded by SEQ ID Nos. 36-46 
(cl. 78, 85, 88, 102) or in the administration of an 
antibody binding to the protein encoded by SEQ ID No. 36-46 
(cl. 95) 

Methods for determining regression, progression or onset of 
a condition characterized by monitoring SEQ ID Nos. 36-46 
molecules or encoded fragments, etc. (cl. 5). 
Pharmaceutical compositions comprising an agent that 
enriches selectively the presence of complexes of HLA 
molecules and molecules encoded by SEQ ID NOs. 36-46 (cl.14) 
and use in a method of treatment (cl.98) 

Composition comprising an agent binding to proteins encoded 
by SEQ ID Nos. 36-46 (cl.30, 32, 34) or a conjugate thereof 

(cl. 39) . 

Method of treatment by inhibiting expression of a protein 

encoded by SEQ ID Nos. 36-46 (cl. 104) 

Method of diagnosing using an agent binding to SEQ ID Nos. 
36-46 molecule or protein encoded (cl.l) 
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